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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 966 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..966 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG AAT AAC ATC GGC ATT ACT GTT TAT GGA TGT GAG CAG GAT GAG GCA 
Met Asn Asn lie Gly lie Thr Val Tyr Gly Cys Glu Gin Asp Glu Ala 
15 10 15 

GAT GCA TTC CAT GCT CTT TCG CCT CGC TTT GGC GTT ATG GCA ACG ATA 
Asp Ala Phe His Ala Leu Ser Pro Arg Phe Gly Val Met Ala Thr lie 
20 25 30 

ATT AAC GCC AAC GTG TCG GAA TCC AAC GCC AAA TCC GCG CCT TTC AAT 
lie Asn Ala Asn Val Ser Glu Ser Asn Ala Lys Ser Ala Pro Phe Asn 
35 40 45 

CAA TGT ATC AGT GTG GGA CAT AAA TCA GAG ATT TCC GCC TCT ATT CTT 
Gin Cys lie Ser Val Gly His Lys Ser Glu lie Ser Ala Ser lie Leu 
50 55 60 

CTT GCG CTG AAG AGA GCC GGT GTG AAA TAT ATT TCT ACC CGA AGC ATC 
Leu Ala Leu Lys Arg Ala Gly Val Lys Tyr lie Ser Thr Arg Ser lie 
65 70 75 80 

GGC TGC AAT CAT ATA GAT ACA ACT GCT GCT AAG AGA ATG GGC ATC ACT 
Gly Cys Asn His lie Asp Thr Thr Ala Ala Lys Arg Met Gly lie Thr 

85 90 95 



GTC GAC AAT GTG GCG TAC TCG CCG GAT AGC GTT GCC GAT TAT ACT ATG 
Val Asp Asn Val Ala Tyr Ser Pro Asp Ser Val Ala Asp Tyr Thr Met 
100 105 110 



ATG CTA ATT CTT ATG GCA GTA CGC AAC GTA AAA TCG ATT GTG CGC TCT 
Met Leu lie Leu Met Ala Val Arg Asn Val Lys Ser lie Val Arg Ser 
115 120 125 



GTG GAA AAA CAT GAT TTC AGG TTG GAC AGC GAC CGT GGC AAG GTA CTC 
Val Glu Lys His Asp Phe Arg Leu Asp Ser Asp Arg Gly Lys Val Leu 
130 135 140 

AGC GAC ATG ACA GTT GGT GTG GTG GGA ACG GGC CAG ATA GGC AAA GCG 
Ser Asp Met Thr Val Gly Val Val Gly Thr Gly Gin lie Gly Lys Ala 
145 150 155 160 

GTT ATT GAG CGG CTG CGA GGA TTT GGA TGT AAA GTG TTG GCT TAT AGT 
Val lie Glu Arg Leu Arg Gly Phe Gly Cys Lys Val Leu Ala Tyr Ser 

165 170 175 

CGC AGC CGA AGT ATA GAG GTA AAC TAT GTA CCG TTT GAT GAG TTG CTG 
Arg Ser Arg Ser lie Glu Val Asn Tyr Val Pro Phe Asp Glu Leu Leu 
180 185 190 

CAA AAT AGC GAT ATC GTT ACG CTT CAT GTG CCG CTC AAT ACG GAT ACG 
Gin Asn Ser Asp lie Val Thr Leu His Val Pro Leu Asn Thr Asp Thr 
195 200 205 

CAC TAT ATT ATC AGC CAC GAA CAA ATA CAG AGA ATG AAG CAA GGA GCA 
His Tyr lie lie Ser His Glu Gin lie Gin Arg Met Lys Gin Gly Ala 
210 215 220 

TTT CTT ATC AAT ACT GGG CGC GGT CCA CTT GTA GAT ACC TAT GAG TTG 
Phe Leu lie Asn Thr Gly Arg Gly Pro Leu Val Asp Thr Tyr Glu Leu 
225 230 235 240 

GTT AAA GCA TTA GAA AAC GGG AAA CTG GGC GGT GCC GCA TTG GAT GTA 
Val Lys Ala Leu Glu Asn Gly Lys Leu Gly Gly Ala Ala Leu Asp Val 

245 250 255 

TTG GAA GGA GAG GAA GAG TTT TTC TAC TCT GAT TGC ACC CAA AAA CCA 
Leu Glu Gly Glu Glu Glu Phe Phe Tyr Ser Asp Cys Thr Gin Lys Pro 
260 265 270 

ATT GAT AAT CAA TTT TTA CTT AAA CTT CAA AGA ATG CCT AAC GTG ATA 
lie Asp Asn Gin Phe Leu Leu Lys Leu Gin Arg Met Pro Asn Val lie 
275 280 285 

ATC ACA CCG CAT ACG GCC TAT TAT ACC GAG CAA GCG TTG CGT GAT ACC 
lie Thr Pro His Thr Ala Tyr Tyr Thr Glu Gin Ala Leu Arg Asp Thr 
290 295 300 

GTT GAA AAA ACC ATT AAA AAC TGT TTG GAT TTT GAA AGG AGA CAG GAG 
Val Glu Lys Thr lie Lys Asn Cys Leu Asp Phe Glu Arg Arg Gin Glu 
305 310 315 320 

CAT GAA 
His Glu 



* (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Asn Asn lie Gly lie Thr Val Tyr Gly Cys Glu Gin Asp Glu Ala 
15 10 15 

Asp Ala Phe His Ala Leu Ser Pro Arg Phe Gly Val Met Ala Thr lie 
20 25 30 

lie Asn Ala Asn Val Ser Glu Ser Asn Ala Lys Ser Ala Pro Phe Asn 
35 40 45 

Gin Cys lie Ser Val Gly His Lys Ser Glu lie Ser Ala Ser lie Leu 
50 55 60 

Leu Ala Leu Lys Arg Ala Gly Val Lys Tyr lie Ser Thr Arg Ser lie 
65 70 75 80 

Gly Cys Asn His lie Asp Thr Thr Ala Ala Lys Arg Met Gly lie Thr 

85 90 95 

Val Asp Asn Val Ala Tyr Ser Pro Asp Ser Val Ala Asp Tyr Thr Met 
100 105 110 

Met Leu lie Leu Met Ala Val Arg Asn Val Lys Ser lie Val Arg Ser 
115 120 125 

Val Glu Lys His Asp Phe Arg Leu Asp Ser Asp Arg Gly Lys Val Leu 
130 135 140 

Ser Asp Met Thr Val Gly Val Val Gly Thr Gly Gin lie Gly Lys Ala 
145 150 155 160 

Val lie Glu Arg Leu Arg Gly Phe Gly Cys Lys Val Leu Ala Tyr Ser 

165 170 175 

Arg Ser Arg Ser lie Glu Val Asn Tyr Val Pro Phe Asp Glu Leu Leu 
180 185 190 

Gin Asn Ser Asp lie Val Thr Leu His Val Pro Leu Asn Thr Asp Thr 
195 200 205 

His Tyr lie lie Ser His Glu Gin lie Gin Arg Met Lys Gin Gly Ala 
210 215 220 

Phe Leu lie Asn Thr Gly Arg Gly Pro Leu Val Asp Thr Tyr Glu Leu 
225 230 235 240 

Val Lys Ala Leu Glu Asn Gly Lys Leu Gly Gly Ala Ala Leu Asp Val 

245 250 255 



Leu Glu Gly Glu 
260 

lie Asp Asn Gin 
275 

lie Thr Pro His 
290 

Val Glu Lys Thr 
305 

His Glu 



Glu Glu Phe Phe 



Phe Leu Leu Lys 
280 

Thr Ala Tyr Tyr 
295 

lie Lys Asn Cys 
310 



Tyr Ser Asp Cys 
265 

Leu Gin Arg Met 



Thr Glu Gin Ala 
300 

Leu Asp Phe Glu 
315 



Thr Gin Lys Pro 
270 

Pro Asn Val lie 
285 

Leu Arg Asp Thr 



Arg Arg Gin Glu 
320 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG AAT AGA ATA AAA GTT GCA ATA CTG TTT GGG GGT TGC TCA GAG GAG 48 
Met Asn Arg lie Lys Val Ala lie Leu Phe Gly Gly Cys Ser Glu Glu 
1 5 10 15 

CAT GAC GTA TCG GTA AAA TCT GCA ATA GAG ATA GCC GCT AAC ATT AAT 9 6 

His Asp Val Ser Val Lys Ser Ala lie Glu lie Ala Ala Asn lie Asn 
20 25 30 

AAA GAA AAA TAC GAG CCG TTA TAC ATT GGA ATT ACG AAA TCT GGT GTA 144 
Lys Glu Lys Tyr Glu Pro Leu Tyr lie Gly lie Thr Lys Ser Gly Val 
35 40 45 

TGG AAA ATG TGC GAA AAA CCT TGC GCG GAA TGG GAA AAC GAC AAT TGC 192 
Trp Lys Met Cys Glu Lys Pro Cys Ala Glu Trp Glu Asn Asp Asn Cys 
50 55 60 

TAT TCA GCT GTA CTC TCG CCG GAT AAA AAA ATG CAC GGA TTA CTT GTT 240 
Tyr Ser Ala Val Leu Ser Pro Asp Lys Lys Met His Gly Leu Leu Val 
65 70 75 80 



AAA AAG AAC CAT GAA TAT GAA ATC AAC CAT GTT GAT GTA GCA TTT TCA 
Lys Lys Asn His Glu Tyr Glu lie Asn His Val Asp Val Ala Phe Ser 

85 90 95 



288 



-GCT TTG CAT GGC AAG TCA GGT GAA GAT GGA TCC ATA CAA GGT CTG TTT 
Ala Leu His Gly Lys Ser Gly Glu Asp Gly Ser lie Gin Gly Leu Phe 
100 105 110 



GAA TTG TCC GGT ATC CCT TTT GTA GGC TGC GAT ATT CAA AGC TCA GCA 
Glu Leu Ser Gly lie Pro Phe Val Gly Cys Asp lie Gin Ser Ser Ala 
115 120 125 

ATT TGT ATG GAC AAA TCG TTG ACA TAC ATC GTT GCG AAA AAT GCT GGG 
lie Cys Met Asp Lys Ser Leu Thr Tyr lie Val Ala Lys Asn Ala Gly 
130 135 140 

ATA GCT ACT CCC GCC TTT TGG GTT ATT AAT AAA GAT GAT AGG CCG GTG 
lie Ala Thr Pro Ala Phe Trp Val lie Asn Lys Asp Asp Arg Pro Val 
145 150 155 160 

GCA GCT ACG TTT ACC TAT CCT GTT TTT GTT AAG CCG GCG CGT TCA GGC 
Ala Ala Thr Phe Thr Tyr Pro Val Phe Val Lys Pro Ala Arg Ser Gly 

165 170 175 

TCA TCC TTC GGT GTG AAA AAA GTC AAT AGC GCG GAC GAA TTG GAC TAC 
Ser Ser Phe Gly Val Lys Lys Val Asn Ser Ala Asp Glu Leu Asp Tyr 
180 185 190 

GCA ATT GAA TCG GCA AGA CAA TAT GAC AGC AAA ATC TTA ATT GAG CAG 
Ala lie Glu Ser Ala Arg Gin Tyr Asp Ser Lys lie Leu lie Glu Gin 
195 200 205 

GCT GTT TCG GGC TGT GAG GTC GGT TGT GCG GTA TTG GGA AAC AGT GCC 
Ala Val Ser Gly Cys Glu Val Gly Cys Ala Val Leu Gly Asn Ser Ala 
210 215 220 

GCG TTA GTT GTT GGC GAG GTG GAC CAA ATC AGG CTG CAG TAC GGA ATC 
Ala Leu Val Val Gly Glu Val Asp Gin lie Arg Leu Gin Tyr Gly lie 
225 230 235 240 

TTT CGT ATT CAT CAG GAA GTC GAG CCG GAA AAA GGC TCT GAA AAC GCA 
Phe Arg lie His Gin Glu Val Glu Pro Glu Lys Gly Ser Glu Asn Ala 

245 250 255 

GTT ATA ACC GTT CCC GCA GAC CTT TCA GCA GAG GAG CGA GGA CGG ATA 
Val lie Thr Val Pro Ala Asp Leu Ser Ala Glu Glu Arg Gly Arg lie 
260 265 270 

CAG GAA ACG GCA AAA AAA ATA TAT AAA GCG CTC GGC TGT AGA GGT CTA 
Gin Glu Thr Ala Lys Lys lie Tyr Lys Ala Leu Gly Cys Arg Gly Leu 
275 280 285 

GCC CGT GTG GAT ATG TTT TTA CAA GAT AAC GGC CGC ATT GTA CTG AAC 
Ala Arg Val Asp Met Phe Leu Gin Asp Asn Gly Arg lie Val Leu Asn 
290 295 300 

GAA GTC AAT ACT CTG CCC GGT TTC ACG TCA TAC AGT CGT TAT CCC CGT 
Glu Val Asn Thr Leu Pro Gly Phe Thr Ser Tyr Ser Arg Tyr Pro Arg 
305 310 315 320 



. ATG ATG GCC GCT GCA GGT ATT GCA CTT CCC GAA CTG ATT GAC CGC TTG 
" Met Met Ala Ala Ala Gly lie Ala Leu Pro Glu Leu lie Asp Arg Leu 

325 330 335 



1008 



ATC GTA TTA GCG TTA AAG GGG 1029 
lie Val Leu Ala Leu Lys Gly 
340 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 3 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Asn Arg lie Lys Val Ala lie Leu Phe Gly Gly Cys Ser Glu Glu 
15 10 15 

His Asp Val Ser Val Lys Ser Ala lie Glu lie Ala Ala Asn lie Asn 
20 25 30 

Lys Glu Lys Tyr Glu Pro Leu Tyr lie Gly lie Thr Lys Ser Gly Val 
35 40 45 

Trp Lys Met Cys Glu Lys Pro Cys Ala Glu Trp Glu Asn Asp Asn Cys 
50 55 60 

Tyr Ser Ala Val Leu Ser Pro Asp Lys Lys Met His Gly Leu Leu Val 
65 70 75 80 

Lys Lys Asn His Glu Tyr Glu lie Asn His Val Asp Val Ala Phe Ser 

85 90 95 

Ala Leu His Gly Lys Ser Gly Glu Asp Gly Ser lie Gin Gly Leu Phe 
100 105 110 

Glu Leu Ser Gly lie Pro Phe Val Gly Cys Asp lie Gin Ser Ser Ala 
115 120 125 

lie Cys Met Asp Lys Ser Leu Thr Tyr lie Val Ala Lys Asn Ala Gly 
130 135 140 

lie Ala Thr Pro Ala Phe Trp Val lie Asn Lys Asp Asp Arg Pro Val 
145 150 155 160 

Ala Ala Thr Phe Thr Tyr Pro Val Phe Val Lys Pro Ala Arg Ser Gly 

165 170 175 

Ser Ser Phe Gly Val Lys Lys Val Asn Ser Ala Asp Glu Leu Asp Tyr 
180 185 190 

Ala lie Glu Ser Ala Arg Gin Tyr Asp Ser Lys lie Leu lie Glu Gin 
195 200 205 



.Ala Val Ser Gly Cys Glu Val Gly Cys Ala Val Leu Gly Asn Ser Ala 
210 215 220 

Ala Leu Val Val Gly Glu Val Asp Gin lie Arg Leu Gin Tyr Gly lie 
225 230 235 240 

Phe Arg lie His Gin Glu Val Glu Pro Glu Lys Gly Ser Glu Asn Ala 

245 250 255 

Val lie Thr Val Pro Ala Asp Leu Ser Ala Glu Glu Arg Gly Arg lie 
260 265 270 

Gin Glu Thr Ala Lys Lys lie Tyr Lys Ala Leu Gly Cys Arg Gly Leu 
275 280 285 

Ala Arg Val Asp Met Phe Leu Gin Asp Asn Gly Arg lie Val Leu Asn 
290 295 300 

Glu Val Asn Thr Leu Pro Gly Phe Thr Ser Tyr Ser Arg Tyr Pro Arg 
305 310 315 320 

Met Met Ala Ala Ala Gly lie Ala Leu Pro Glu Leu lie Asp Arg Leu 

325 330 335 

lie Val Leu Ala Leu Lys Gly 
340 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 606 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..606 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG GAA ATA GGA TTT ACT TTT TTA GAT GAA ATA GTA CAC GGT GTT CGT 
Met Glu lie Gly Phe Thr Phe Leu Asp Glu lie Val His Gly Val Arg 
1 5 10 15 

TGG GAC GCT AAA TAT GCC ACT TGG GAT AAT TTC ACC GGA AAA CCG GTT 
Trp Asp Ala Lys Tyr Ala Thr Trp Asp Asn Phe Thr Gly Lys Pro Val 
20 25 30 



GAC GGT TAT GAA GTA AAT CGC ATT GTA GGG ACA TAC GAG TTG GCT GAA 
Asp Gly Tyr Glu Val Asn Arg lie Val Gly Thr Tyr Glu Leu Ala Glu 
35 40 45 



.TCG CTT TTG AAG GCA AAA GAA CTG GCT GCT ACC CAA GGG TAC GGA TTG 192 
Ser Leu Leu Lys Ala Lys Glu Leu Ala Ala Thr Gin Gly Tyr Gly Leu 
50 55 60 

CTT CTA TGG GAC GGT TAC CGT CCT AAG CGT GCT GTA AAC TGT TTT ATG 240 
Leu Leu Trp Asp Gly Tyr Arg Pro Lys Arg Ala Val Asn Cys Phe Met 
65 70 75 80 

CAA TGG GCT GCA CAG CCG GAA AAT AAC CTG ACA AAG GAA AGT TAT TAT 2 88 

Gin Trp Ala Ala Gin Pro Glu Asn Asn Leu Thr Lys Glu Ser Tyr Tyr 

85 90 95 

CCC AAT ATT GAC CGA ACT GAG ATG ATT TCA AAA GGA TAC GTG GCT TCA 33 6 

Pro Asn lie Asp Arg Thr Glu Met lie Ser Lys Gly Tyr Val Ala Ser 
100 105 110 

AAA TCA AGC CAT AGC CGC GGC AGT GCC ATT GAT CTT ACG CTT TAT CGA 384 
Lys Ser Ser His Ser Arg Gly Ser Ala lie Asp Leu Thr Leu Tyr Arg 
115 120 125 

TTA GAC ACG GGT GAG CTT GTA CCA ATG GGG AGC CGA TTT GAT TTT ATG 4 32 

Leu Asp Thr Gly Glu Leu Val Pro Met Gly Ser Arg Phe Asp Phe Met 
130 135 140 

GAT GAA CGC TCT CAT CAT GCG GCA AAT GGA ATA TCA TGC AAT GAA GCG 48 0 

Asp Glu Arg Ser His His Ala Ala Asn Gly lie Ser Cys Asn Glu Ala 
145 150 155 160 

CAA AAT CGC AGA CGT TTG CGC TCC ATC ATG GAA AAC AGT GGG TTT GAA 528 
Gin Asn Arg Arg Arg Leu Arg Ser lie Met Glu Asn Ser Gly Phe Glu 

165 170 175 

GCA TAT AGC CTC GAA TGG TGG CAC TAT GTA TTA AGA GAC GAA CCA TAC 57 6 

Ala Tyr Ser Leu Glu Trp Trp His Tyr Val Leu Arg Asp Glu Pro Tyr 
180 185 190 

CCC AAT AGC TAT TTT GAT TTC CCC GTT AAA 60 6 

Pro Asn Ser Tyr Phe Asp Phe Pro Val Lys 
195 200 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 02 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Glu lie Gly Phe Thr Phe Leu Asp Glu lie Val His Gly Val Arg 
15 10 15 



Trp Asp Ala Lys Tyr Ala Thr Trp Asp Asn Phe Thr Gly Lys Pro Val 
20 25 30 



Asp Gly Tyr Glu Val Asn Arg lie Val Gly Thr Tyr Glu Leu Ala Glu 
35 40 45 

Ser Leu Leu Lys Ala Lys Glu Leu Ala Ala Thr Gin Gly Tyr Gly Leu 
50 55 60 

Leu Leu Trp Asp Gly Tyr Arg Pro Lys Arg Ala Val Asn Cys Phe Met 
65 70 75 80 

Gin Trp Ala Ala Gin Pro Glu Asn Asn Leu Thr Lys Glu Ser Tyr Tyr 

85 90 95 

Pro Asn lie Asp Arg Thr Glu Met lie Ser Lys Gly Tyr Val Ala Ser 
100 105 110 

Lys Ser Ser His Ser Arg Gly Ser Ala lie Asp Leu Thr Leu Tyr Arg 
115 120 125 

Leu Asp Thr Gly Glu Leu Val Pro Met Gly Ser Arg Phe Asp Phe Met 
130 135 140 

Asp Glu Arg Ser His His Ala Ala Asn Gly lie Ser Cys Asn Glu Ala 
145 150 155 160 

Gin Asn Arg Arg Arg Leu Arg Ser lie Met Glu Asn Ser Gly Phe Glu 

165 170 175 

Ala Tyr Ser Leu Glu Trp Trp His Tyr Val Leu Arg Asp Glu Pro Tyr 
180 185 190 

Pro Asn Ser Tyr Phe Asp Phe Pro Val Lys 
195 200 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1347 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 215 ..1243 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GAGCTCTTCC TTCAACGCAC TTCTGTACCA AGAGTTGTTG TCCATTTGAT CACTAACAAT 
AGCTTCCCCT GCTTTCTTCA AGCCCTTTGT CATAAAATCG TTAGATTTTC ATCATAAAAA 
TACGAGAAAG ACAACAGGAA GACCGCAAAT TTTCTTTTCT TTTCCTAGGT ACACTGAATG 



TAACCTTAAA AGAAAAAAGG AAAGGAAGAA AATG ATG AAA AAA ATT GCC GTT 2 32 

Met Lys Lys lie Ala Val 
1 5 

TTA TTT GGA GGG AAT TCT CCA GAA TAC TCA GTG TCA CTA ACC TCA GCA 2 80 

Leu Phe Gly Gly Asn Ser Pro Glu Tyr Ser Val Ser Leu Thr Ser Ala 
10 15 20 

GCA AGT GTG ATC CAA GCT ATT GAC CCG CTG AAA TAT GAA GTA ATG ACC 3 28 

Ala Ser Val lie Gin Ala lie Asp Pro Leu Lys Tyr Glu Val Met Thr 
25 30 35 

ATT GGC ATC GCA CCA ACA ATG GAT TGG TAT TGG TAT CAA GGA AAC CTC 37 6 

lie Gly lie Ala Pro Thr Met Asp Trp Tyr Trp Tyr Gin Gly Asn Leu 
40 45 50 

GCG AAT GTT CGC AAT GAT ACT TGG CTA GAA GAT CAC AAA AAC TGT CAC 4 24 

Ala Asn Val Arg Asn Asp Thr Trp Leu Glu Asp His Lys Asn Cys His 
55 60 65 70 

CAG CTG ACT TTT TCT AGC CAA GGA TTT ATA TTA GGA GAA AAA CGA ATC 472 
Gin Leu Thr Phe Ser Ser Gin Gly Phe lie Leu Gly Glu Lys Arg lie 

75 80 85 

GTC CCT GAT GTC CTC TTT CCA GTC TTG CAT GGG AAG TAT GGC GAG GAT 52 0 

Val Pro Asp Val Leu Phe Pro Val Leu His Gly Lys Tyr Gly Glu Asp 
90 95 100 

GGC TGT ATC CAA GGA CTG CTT GAA CTA ATG AAC CTG CCT TAT GTT GGT 568 
Gly Cys lie Gin Gly Leu Leu Glu Leu Met Asn Leu Pro Tyr Val Gly 
105 110 115 

TGC CAT GTC GCT GCC TCC GCA TTA TGT ATG AAC AAA TGG CTC TTG CAT 616 
Cys His Val Ala Ala Ser Ala Leu Cys Met Asn Lys Trp Leu Leu His 
120 125 130 

CAA CTT GCT GAT ACC ATG GGA ATC GCT AGT GCT CCC ACT TTG CTT TTA 664 
Gin Leu Ala Asp Thr Met Gly lie Ala Ser Ala Pro Thr Leu Leu Leu 
135 140 145 150 

TCC CGC TAT GAA AAC GAT CCT GCC ACA ATC GAT CGT TTT ATT CAA GAC 712 
Ser Arg Tyr Glu Asn Asp Pro Ala Thr lie Asp Arg Phe lie Gin Asp 

155 160 165 

CAT GGA TTC CCG ATC TTT ATC AAG CCG AAT GAA GCC GGT TCT TCA AAA 7 60 

His Gly Phe Pro lie Phe lie Lys Pro Asn Glu Ala Gly Ser Ser Lys 
170 175 180 

GGG ATC ACA AAA GTA ACT GAC AAA ACA GCG CTC CAA TCT GCA TTA ACG 808 
Gly lie Thr Lys Val Thr Asp Lys Thr Ala Leu Gin Ser Ala Leu Thr 
185 190 195 

ACT GCT TTT GCT TAC GGT TCT ACT GTG TTG ATC CAA AAG GCG ATA GCG 856 
Thr Ala Phe Ala Tyr Gly Ser Thr Val Leu lie Gin Lys Ala lie Ala 
200 205 210 



.GGT ATT GAA ATT GGC TGC GGC ATC TTA GGA AAT GAG CAA TTG ACG ATT 9 04 

Gly lie Glu lie Gly Cys Gly lie Leu Gly Asn Glu Gin Leu Thr lie 
215 220 225 230 

GGT GCT TGT GAT GCG ATT TCT CTT GTC GAC GGT TTT TTT GAT TTT GAA 9 52 

Gly Ala Cys Asp Ala lie Ser Leu Val Asp Gly Phe Phe Asp Phe Glu 

235 240 245 

GAG AAA TAG CAA TTA ATC AGC GCC ACG ATC ACT GTC CCA GCA CCA TTG 1000 
Glu Lys Tyr Gin Leu lie Ser Ala Thr lie Thr Val Pro Ala Pro Leu 
250 255 260 

CCT CTC GCG CTT GAA TCA CAG ATC AAG GAG CAG GCA CAG CTG CTT TAT 1048 
Pro Leu Ala Leu Glu Ser Gin lie Lys Glu Gin Ala Gin Leu Leu Tyr 
265 270 275 

CGA AAC TTG GGA TTG ACG GGT CTG GCT CGA ATC GAT TTT TTC GTC ACC 109 6 

Arg Asn Leu Gly Leu Thr Gly Leu Ala Arg lie Asp Phe Phe Val Thr 
280 285 290 

AAT CAA GGA GCG ATT TAT TTA AAC GAA ATC AAC ACC ATG CCG GGA TTT 114 4 

Asn Gin Gly Ala lie Tyr Leu Asn Glu lie Asn Thr Met Pro Gly Phe 
295 300 305 310 

ACT GGG CAC TCC CGC TAC CCA GCT ATG ATG GCG GAA GTC GGG TTA TCC 1192 
Thr Gly His Ser Arg Tyr Pro Ala Met Met Ala Glu Val Gly Leu Ser 

315 320 325 

TAC GAA ATA TTA GTA GAG CAA TTG ATT GCA CTG GCA GAG GAG GAC AAA 124 0 

Tyr Glu lie Leu Val Glu Gin Leu lie Ala Leu Ala Glu Glu Asp Lys 
330 335 340 

CGA TGAACACATT ACAATTGATC AATAAAAACC ATCCATTGAA AAAAAATCAA 1293 
Arg 

GAGCCCCCGC ACTTAGTGCT AGCTCCTTTT AGCGATCACG ATGTTTACCT GCAG 1347 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Lys Lys lie Ala Val Leu Phe Gly Gly Asn Ser Pro Glu Tyr Ser 
1 5 10 15 

Val Ser Leu Thr Ser Ala Ala Ser Val lie Gin Ala lie Asp Pro Leu 
20 25 30 



Lys Tyr Glu Val Met Thr lie Gly lie Ala Pro Thr Met Asp Trp Tyr 
35 40 45 



.Trp Tyr Gin Gly Asn Leu Ala Asn Val Arg Asn Asp Thr Trp Leu Glu 
50 55 60 

Asp His Lys Asn Cys His Gin Leu Thr Phe Ser Ser Gin Gly Phe lie 
65 70 75 80 

Leu Gly Glu Lys Arg lie Val Pro Asp Val Leu Phe Pro Val Leu His 

85 90 95 

Gly Lys Tyr Gly Glu Asp Gly Cys lie Gin Gly Leu Leu Glu Leu Met 
100 105 110 

Asn Leu Pro Tyr Val Gly Cys His Val Ala Ala Ser Ala Leu Cys Met 
115 120 125 

Asn Lys Trp Leu Leu His Gin Leu Ala Asp Thr Met Gly lie Ala Ser 
130 135 140 

Ala Pro Thr Leu Leu Leu Ser Arg Tyr Glu Asn Asp Pro Ala Thr lie 
145 150 155 160 

Asp Arg Phe lie Gin Asp His Gly Phe Pro lie Phe lie Lys Pro Asn 

165 170 175 

Glu Ala Gly Ser Ser Lys Gly lie Thr Lys Val Thr Asp Lys Thr Ala 
180 185 190 

Leu Gin Ser Ala Leu Thr Thr Ala Phe Ala Tyr Gly Ser Thr Val Leu 
195 200 205 

lie Gin Lys Ala lie Ala Gly lie Glu lie Gly Cys Gly lie Leu Gly 
210 215 220 

Asn Glu Gin Leu Thr lie Gly Ala Cys Asp Ala lie Ser Leu Val Asp 
225 230 235 240 

Gly Phe Phe Asp Phe Glu Glu Lys Tyr Gin Leu lie Ser Ala Thr lie 

245 250 255 

Thr Val Pro Ala Pro Leu Pro Leu Ala Leu Glu Ser Gin lie Lys Glu 
260 265 270 

Gin Ala Gin Leu Leu Tyr Arg Asn Leu Gly Leu Thr Gly Leu Ala Arg 
275 280 285 

lie Asp Phe Phe Val Thr Asn Gin Gly Ala lie Tyr Leu Asn Glu lie 
290 295 300 

Asn Thr Met Pro Gly Phe Thr Gly His Ser Arg Tyr Pro Ala Met Met 
305 310 315 320 

Ala Glu Val Gly Leu Ser Tyr Glu lie Leu Val Glu Gin Leu lie Ala 

325 330 335 

Leu Ala Glu Glu Asp Lys Arg 
340 



-(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GGNGARGAYG GNWSNHTNCA RGGN 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AAYACNHTNC CNGGNTTYAC 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..693 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATG AGC GAT AAA ATA CTT ATT GTG GAT GAT GAA CAT GAA ATT GCC GAT 
Met Ser Asp Lys lie Leu lie Val Asp Asp Glu His Glu lie Ala Asp 
15 10 15 

TTG GTT GAA TTA TAC TTA AAA AAC GAG AAT TAT ACG GTT TTC AAA TAC 
Leu Val Glu Leu Tyr Leu Lys Asn Glu Asn Tyr Thr Val Phe Lys Tyr 
20 25 30 



TAT ACC GCC AAA GAA GCA TTG GAA TGT ATA GAC AAG TCT GAG ATT GAC 14 4 

Tyr Thr Ala Lys Glu Ala Leu Glu Cys lie Asp Lys Ser Glu lie Asp 
35 40 45 

CTT GCC ATA TTG GAC ATC ATG CTT CCC GGC ACA AGC GGC CTT ACT ATC 192 
Leu Ala lie Leu Asp lie Met Leu Pro Gly Thr Ser Gly Leu Thr lie 
50 55 60 

TGT CAA AAA ATA AGG GAC AAG CAC ACC TAT CCG ATT ATC ATG CTG ACC 24 0 

Cys Gin Lys lie Arg Asp Lys His Thr Tyr Pro lie lie Met Leu Thr 
65 70 75 80 

GGG AAA GAT ACA GAG GTA GAT AAA ATT ACA GGG TTA ACA ATC GGC GCG 2 88 

Gly Lys Asp Thr Glu Val Asp Lys lie Thr Gly Leu Thr lie Gly Ala 

85 90 95 

GAT GAT TAT ATA ACG AAG CCC TTT CGC CCA CTG GAG TTA ATT GCT CGG 3 36 

Asp Asp Tyr lie Thr Lys Pro Phe Arg Pro Leu Glu Leu lie Ala Arg 
100 105 110 

GTA AAG GCC CAG TTG CGC CGA TAC AAA AAA TTC AGT GGA GTA AAG GAG 3 84 

Val Lys Ala Gin Leu Arg Arg Tyr Lys Lys Phe Ser Gly Val Lys Glu 
115 120 125 

CAG AAC GAA AAT GTT ATC GTC CAC TCC GGC CTT GTC ATT AAT GTT AAC 4 32 

Gin Asn Glu Asn Val lie Val His Ser Gly Leu Val lie Asn Val Asn 
130 135 140 

ACC CAT GAG TGT TAT CTG AAC GAG AAG CAG TTA TCC CTT ACT CCC ACC 480 
Thr His Glu Cys Tyr Leu Asn Glu Lys Gin Leu Ser Leu Thr Pro Thr 
145 150 155 160 

GAG TTT TCA ATA CTG CGA ATC CTC TGT GAA AAC AAG GGG AAT GTG GTT 52 8 

Glu Phe Ser lie Leu Arg lie Leu Cys Glu Asn Lys Gly Asn Val Val 

165 170 175 

AGC TCC GAG CTG CTA TTT CAT GAG ATA TGG GGC GAC GAA TAT TTC AGC 576 
Ser Ser Glu Leu Leu Phe His Glu lie Trp Gly Asp Glu Tyr Phe Ser 
180 185 190 

AAG AGC AAC AAC ACC ATC ACC GTG~ CAT ATC CGG CAT TTG CGC GAA AAA 624 
Lys Ser Asn Asn Thr lie Thr Val His lie Arg His Leu Arg Glu Lys 
195 200 205 

ATG AAC GAC ACC ATT GAT AAT CCG AAA TAT ATA AAA ACG GTA TGG GGG 672 
Met Asn Asp Thr lie Asp Asn Pro Lys Tyr lie Lys Thr Val Trp Gly 
210 215 220 

GTT GGT TAT AAA ATT GAA AAA 69 3 

Val Gly Tyr Lys lie Glu Lys 
225 230 . 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ser Asp Lys lie Leu lie Val Asp Asp Glu His Glu lie Ala Asp 
15 10 15 

Leu Val Glu Leu Tyr Leu Lys Asn Glu Asn Tyr Thr Val Phe Lys Tyr 
20 25 30 

Tyr Thr Ala Lys Glu Ala Leu Glu Cys lie Asp Lys Ser Glu lie Asp 
35 40 45 

Leu Ala lie Leu Asp lie Met Leu Pro Gly Thr Ser Gly Leu Thr lie 
50 55 60 

Cys Gin Lys lie Arg Asp Lys His Thr Tyr Pro lie lie Met Leu Thr 
65 70 75 80 

Gly Lys Asp Thr Glu Val Asp Lys lie Thr Gly Leu Thr lie Gly Ala 

85 90 95 * . 

Asp Asp Tyr lie Thr Lys Pro Phe Arg Pro Leu Glu Leu lie Ala Arg 
100 105 110 

Val Lys Ala Gin Leu Arg Arg Tyr Lys Lys Phe Ser Gly Val Lys Glu 
115 120 125 

Gin Asn Glu Asn Val lie Val His Ser Gly Leu Val lie Asn Val Asn 
130 135 140 

Thr His Glu Cys Tyr Leu Asn Glu Lys Gin Leu Ser Leu Thr Pro Thr 
145 150 155 160 

Glu Phe Ser lie Leu Arg lie Leu Cys Glu Asn Lys Gly Asn Val Val 

165 170 175 

Ser Ser Glu Leu Leu Phe His Glu lie Trp Gly Asp Glu Tyr Phe Ser 
180 185 190 

Lys Ser Asn Asn Thr lie Thr Val His lie Arg His Leu Arg Glu Lys 
195 200 205 

Met Asn Asp Thr lie Asp Asn Pro Lys Tyr lie Lys Thr Val Trp Gly 
210 215 220 

Val Gly Tyr Lys lie Glu Lys 
225 230 



.(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1152 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1152 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TTG GTT ATA AAA TTG AAA AAT AAA AAA AAC GAC TAT TCC AAA CTA GAA 
Leu Val lie Lys Leu Lys Asn Lys Lys Asn Asp Tyr Ser Lys Leu Glu 
15 10 15 

CGA AAA CTT TAC ATG TAT ATC GTT GCA ATT GTT GTG GTA GCA ATT GTA 
Arg Lys Leu Tyr Met Tyr lie Val Ala lie Val Val Val Ala lie Val 
20 25 30 

TTC GTG TTG TAT ATT CGT TCA ATG ATC CGA GGG AAA CTT GGG GAT TGG 
Phe Val Leu Tyr lie Arg Ser Met lie Arg Gly Lys Leu Gly Asp Trp 
35 40 45 

ATC TTA AGT ATT TTG GAA AAC AAA TAT GAC TTA AAT CAC CTG GAC GCG 
lie Leu Ser lie Leu Glu Asn Lys Tyr Asp Leu Asn His Leu Asp Ala 
50 55 60 

ATG AAA TTA TAT CAA TAT TCC ATA CGG AAC AAT ATA GAT ATC TTT ATT 
Met Lys Leu Tyr Gin Tyr Ser lie Arg Asn Asn lie Asp lie Phe lie 
65 70 75 80 

TAT GTG GCG ATT GTC ATT AGT ATT CTT ATT CTA TGT CGC GTC ATG CTT 
Tyr Val Ala lie Val lie Ser lie Leu lie Leu Cys Arg Val Met Leu 

85 90 95 

TCA AAA TTC GCA AAA TAC TTT GAC GAG ATA AAT ACC GGC ATT GAT GTA 
Ser Lys Phe Ala Lys Tyr Phe Asp Glu lie Asn Thr Gly lie Asp Val 
100 105 110 

CTT ATT CAG AAC GAA GAT AAA CAA ATT GAG CTT TCT GCG GAA ATG GAT 
Leu lie Gin Asn Glu Asp Lys Gin lie Glu Leu Ser Ala Glu Met Asp 
115 120 125 

GTT ATG GAA CAA AAG CTC AAC ACA TTA AAA CGG ACT CTG GAA AAG CGA 
Val Met Glu Gin Lys Leu Asn Thr Leu Lys Arg Thr Leu Glu Lys Arg 
130 135 140 

GAG CAG GAT GCA AAG CTG GCC GAA CAA AGA AAA AAT GAC GTT GTT ATG 
Glu Gin Asp Ala Lys Leu Ala Glu Gin Arg Lys Asn Asp Val Val Met 
145 150 155 160 



TAG TTG GCG CAC GAT ATT AAA ACG CCC CTT ACA TCC ATT ATC GGT TAT 
Tyr Leu Ala His Asp lie Lys Thr Pro Leu Thr Ser lie lie Gly Tyr 

165 170 175 

TTG AGC CTG CTT GAC GAG GCT CCA GAC ATG CCG GTA GAT CAA AAG GCA 
Leu Ser Leu Leu Asp Glu Ala Pro Asp Met Pro Val Asp Gin Lys Ala 
180 185 190 

AAG TAT GTG CAT ATC ACG TTG GAC AAA GCG TAT CGA CTC GAA CAG CTA 
Lys Tyr Val His lie Thr Leu Asp Lys Ala Tyr Arg Leu Glu Gin Leu 
195 200 205 

ATC GAC GAG TTT TTT GAG ATT ACA CGG TAT AAC CTA CAA ACG ATA ACG 
lie Asp Glu Phe Phe Glu lie Thr Arg Tyr Asn Leu Gin Thr lie Thr 
210 215 220 

CTA ACA AAA ACG CAC ATA GAC CTA TAC TAT ATG CTG GTG CAG ATG ACC 
Leu Thr Lys Thr His lie Asp Leu Tyr Tyr Met Leu Val Gin Met Thr 
225 230 235 240 

GAT GAA TTT TAT CCT CAG CTT TCC GCA CAT GGA AAA CAG GCG GTT ATT 
Asp Glu Phe Tyr Pro Gin Leu Ser Ala His Gly Lys Gin Ala Val lie 

245 250 255 

CAC GCC CCC GAG GAT CTG ACC GTG TCC GGC GAC CCT GAT AAA CTC GCG 
His Ala Pro Glu Asp Leu Thr Val Ser Gly Asp Pro Asp Lys Leu Ala 
260 265 270 

AGA GTC TTT AAC AAC ATT TTG AAA AAC GCC GCT GCA TAC AGT GAG GAT 
Arg Val Phe Asn Asn lie Leu Lys Asn Ala Ala Ala Tyr Ser Glu Asp 
275 280 285 

AAC AGC ATC ATT GAC ATT ACC GCG GGC CTC TCC GGG GAT GTG GTG TCA 
Asn Ser lie lie Asp lie Thr Ala Gly Leu Ser Gly Asp Val Val Ser 
290 295 300 

ATC GAA TTC AAG AAC ACT GGA AGC ATC CCA AAA GAT AAG CTA GCT GCC 
lie Glu Phe Lys Asn Thr Gly Ser lie Pro Lys Asp Lys Leu Ala Ala 
305 310 315 320 

ATA TTT GAA AAG TTC TAT AGG CTG GAC AAT GCT CGT TCT TCC GAT ACG 
He Phe Glu Lys Phe Tyr Arg Leu Asp Asn Ala Arg Ser Ser Asp Thr 

325 330 335 

GGT GGC GCG GGA CTT GGA TTG GCG ATT GCA AAA GAA ATT ATT GTT CAG 
Gly Gly Ala Gly- Leu Gly Leu Ala lie Ala Lys Glu lie lie Val Gin 
340 345 350 

CAT GGA GGG CAG ATT TAC GCG GAA AGC AAT GAT AAC TAT ACG ACG TTT 
His Gly Gly Gin lie Tyr Ala Glu Ser Asn Asp Asn Tyr Thr Thr Phe 
355 360 365 



AGG GTA GAG CTT CCA GCG ATG CCA GAC TTG GTT GAT AAA AGG AGG TCC 
Arg Val Glu Leu Pro Ala Met Pro Asp Leu Val Asp Lys Arg Arg Ser 
370 375 380 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Leu Val lie Lys Leu Lys Asn Lys Lys Asn Asp Tyr Ser Lys Leu Glu 
1 5 10 15 

Arg Lys Leu Tyr Met Tyr lie Val Ala lie Val Val Val Ala lie Val 
20 25 30 

Phe Val Leu Tyr lie Arg Ser Met lie Arg Gly Lys Leu Gly Asp Trp 
35 40 45 

lie Leu Ser lie Leu Glu Asn Lys Tyr Asp Leu Asn His Leu Asp Ala 
50 55 60 

Met Lys Leu Tyr Gin Tyr Ser lie Arg Asn Asn lie Asp lie Phe lie 
65 70 75 80 

Tyr Val Ala lie Val lie Ser lie Leu lie Leu Cys Arg Val Met Leu 

85 90 95 

Ser Lys Phe Ala Lys Tyr Phe Asp Glu lie Asn Thr Gly lie Asp Val 
100 105 110 

Leu lie Gin Asn Glu Asp Lys Gin lie Glu Leu Ser Ala Glu Met Asp 
115 120 125 

Val Met Glu Gin Lys Leu Asn Thr Leu Lys Arg Thr Leu Glu Lys Arg 
130 135 140 

Glu Gin Asp Ala Lys Leu Ala Glu Gin Arg Lys Asn Asp Val Val Met 
145 150 155 160 

Tyr Leu Ala His Asp lie Lys Thr Pro Leu Thr Ser lie lie Gly Tyr 

165 170 175 

Leu Ser Leu Leu Asp Glu Ala Pro Asp Met Pro Val Asp Gin Lys Ala 
180 185 190 

Lys Tyr Val His lie Thr Leu Asp Lys Ala Tyr Arg Leu Glu Gin Leu 
195 200 205 

lie Asp Glu Phe Phe Glu lie Thr Arg Tyr Asn Leu Gin Thr lie Thr 
210 215 220 

Leu Thr Lys Thr His lie Asp Leu Tyr Tyr Met Leu Val Gin Met Thr 
225 230 235 240 

Asp Glu Phe Tyr Pro Gin Leu Ser Ala His Gly Lys Gin Ala Val lie 

245 250 255 



-His Ala Pro Glu Asp Leu Thr Val Ser Gly Asp Pro Asp Lys Leu Ala 
260 265 270 

Arg Val Phe Asn Asn lie Leu Lys Asn Ala Ala Ala Tyr Ser Glu Asp 
275 280 285 

Asn Ser lie lie Asp lie Thr Ala Gly Leu Ser Gly Asp Val Val Ser 
290 295 300 

lie Glu Phe Lys Asn Thr Gly Ser lie Pro Lys Asp Lys Leu Ala Ala 
305 310 315 320 

lie Phe Glu Lys Phe Tyr Arg Leu Asp Asn Ala Arg Ser Ser Asp Thr 

325 330 335 

Gly Gly Ala Gly Leu Gly Leu Ala lie Ala Lys Glu lie lie Val Gin 
340 345 350 

His Gly Gly Gin lie Tyr Ala Glu Ser Asn Asp Asn Tyr Thr Thr Phe 
355 360 365 

Arg Val Glu Leu Pro Ala Met Pro Asp Leu Val Asp Lys Arg Arg Ser 
370 375 380 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 722 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AAGCTTTTCT TTTTGCTCAT TTGTTAGAGA TTTACTAACC GTATTAAATA GCTTCTTTTC 60 

AGCCATTGCC CTTGCTTCCC ACACCATTCT TTCAAGTGTA GTGATAGCAG GCAGTATAAT 12 0 

TTTGTTTTTT CTTAGAAAAT CTATGCATTC ATGCAGTAGA TGAATGGCAT CACCATTTTC 180 

CAAAGCTAAT TGATGAAGGT ACTTAAATGT CATTCGATAT TCACTCAGGG TAAAAGTTAC 240 

AAAGTCGTAT TCACTTCGAA TTTCTTTCAA ATGATCCCAA AGTGTATTTT CCCTTTGAGG 3 00 

ATAATGATCA AGCGAGGATG GACTAACACC AATCTGTTTC GATATATATT GTATGACCGA 3 60 

ATCTGGGATG CTTTTGATAT GAGTGTATGG CCAACCGGGA TACCGAAGAA CAGCTAATTG 420 

AACAGCAAAT CCTAAACGGT TTTCTTCCCT CCTTCGCTTA TTAACTATTT CTAAATCCCG 4 80 

TTTGGAAAAA GTGAAGTAGG TCCCCAGTAT CCATTCATCT TCAGGGATTT GCATAAAAGC 54 0 

CTGTCTCTGT TCCGGTGTAA GCAATTCTCT ACCTCTCGCA ATTTTCATTC AGTATCATTC 600 



CATTTCTGTA TTTTCAATTT ATTAGTTCAA TTATATATCA ATAGAGTGTA CTCTATTGAT 66 0 

ACAAATGTAG TAGACTGATA AAATCATAGT TAAGAGCGTC TCATAAGACT TGTCTCAAAA 72 0 

ATGAGGTGAT ATTTTGCGGA AAATCGGTTA TATTCGTGTC AGTTCGACTA ACCAGAATCC 78 0 

TTCAAGACAA TTTCAGCAGT TGAACGAGAT CGGAATGGAT ATTATATAAA GAGAAAGTTT 84 0 

CAGGAGCAAC AAAGGATCGC GAGCAACTTC AAAAAGTGTT AGACGATTTA CAGGAAGATG 900 

ACATCATTTA TGTTACAGAC TTAACTCGAA TCACTCGTAG TACACAAGAT CTATTTGAAT 9 60 

TAATCGATAA CATACGAGAT AAAAAGGCAA GTTTAAAATC ACTAAAAGAT ACATGGCTTG 102 0 

ATTTATCAGA AGATAATCCA TACAGCCAAT TCTTAATTAC TGTAATGGCT GGTGTTAACC 108 0 

- AATT AG AG CG AGATCTTATT CGGATGAGAC AACGTGAAGG GATTGAATTG GCTAAGAAAG 114 0 

AAGGAAAGTT TAAAGGTCGA TTAAAGAAGT AT C AT AAAAA TCACGCAGGA ATGAATTATG 12 00 

CGGAAAGCTA TATAAAGAAG GAAATATGAC TGTAAATCAA ATTTGTGAAA TTACTAATGT 12 60 

ATCTAGGGCT TCATTATACA GGAAATTATC AGAAGTGAAT AATTAGCCAT TCTGTATTCC 13 2 0 

GCTAATGGGC AATATTTTTA AAGAAGAAAA GGAAACTATA AAATATTAAC AGCCTCCTAG 138 0 

CGATGCCGAA AAGCCCTTTG ATAAAAAAAG AATCATCATC TTAAGAAATT CTTAGTCATT 144 0 

TATTATGTAA ATGCTTATAA ATTCGGCCCT ATAATCTGAT AAATTATTAA GGGCAAACTT 1500 

ATGTGAAAGG GTGATAACTA TGAGCGATAA AATACTTATT GTGGATGATG AACATGAAAT 1560 

TGCCGATTTG GTTGAATTAT ACTTAAAAAA CGAGAATTAT ACGGTTTTCA AATACTATAC 162 0 

CGCCAAAGAA GCATTGGAAT GTATAGACAA GT C TG AG ATT GACCTTGCCA TATTGGACAT 1680 

CATGCTTCCC GGCACAAGCG GCCTTACTAT CTGTCAAAAA ATAAGGGACA AGCACACCTA 174 0 

TCCGATTATC ATGCTGACCG GGAAAGATAC AGAGGTAGAT AAAATTACAG GGTTAACAAT 1800 

CGGCGCGGAT GATTATATAA CGAAGCCCTT TCGCCCACTG GAGTTAATTG CTCGGGTAAA 18 60 

GGCCCAGTTG CGCCGATACA AAAAATTCAG TGGAGTAAAG GAGCAGAACG AAAATGTTAT 19 2 0 

CGTCCACTCC GGCCTTGTCA TTAATGTTAA CACCCATGAG TGTTATCTGA ACGAGAAGCA 1980 

GTTATCCCTT ACTCCCACCG AGTTTTCAAT ACTGCGAATC CTCTGTGAAA ACAAGGGGAA 2 04 0 

TGTGGTTAGC TCCGAGCTGC TATTTCATGA GATATGGGGC GACGAATATT TCAGCAAGAG 2100 

CAACAACACC ATCACCGTGC ATATCCGGCA TTTGCGCGAA AAAATGAACG ACACCATTGA 2160 

TAATCCGAAA TATATAAAAA CGGTATGGGG GGTTGGTTAT AAAATTGAAA AATAAAAAAA 22 20 

ACGACTATTC CAAACTAGAA CGAAAACTTT ACATGTATAT CGTTGCAATT GTTGTGGTAG 2 2 80 

CAATTGTATT CGTGTTGTAT ATTCGTTCAA TGATCCGAGG GAAACTTGGG GATTGGATCT 23 4 0 



^ TAAGTATTTT GGAAAACAAA TATGACTTAA ATCACCTGGA CGCGATGAAA TTATATCAAT 2 4 00 

ATTCCATACG GAACAATATA GATATCTTTA TTTATGTGGC GATTGTCATT AGTATTCTTA 2 4 60 

TTCTATGTCG CGTCATGCTT TCAAAATTCG CAAAATACTT TGACGAGATA AATACCGGCA 2 52 0 

TTGATGTACT TATTCAGAAC GAAGATAAAC AAATTG AG CT TTCTGCGGAA ATGGATGTTA 2 58 0 

TGGAACAAAA GCTCAACACA TTAAAACGGA CTCTGGAAAA GCGAGAGCAG GATGCAAAGC 2 64 0 

TGGCCGAACA AAGAAAAAAT GACGTTGTTA TGTACTTGGC GCACGATATT AAAACGCCCC 2 7 00 

TTACATCCAT TATCGGTTAT TTGAGCCTGC TTGACGAGGC TCCAGACATG CCGGTAGATC 2760 

AAAAGGCAAA GTATGTGCAT ATCACGTTGG AC AAAG CGT A TCGACTCGAA CAGCTAATCG 2 82 0 

- ACGAGTTTTT TGAGATTACA CGGTATAACC TACAAACGAT AACGCTAACA AAAACGCACA 2880 

TAGACCTATA CTATATGCTG GTGCAGATGA CCGATGAATT TTATCCTCAG CTTTCCGCAC 29 40 

ATGGAAAACA GGCGGTTATT CACGCCCCCG AGGATCTGAC CGTGTCCGGC GACCCTGATA 3 000 

AACTCGCGAG AGTCTTTAAC AACATTTTGA AAAACGCCGC TGCATACAGT GAGGATAACA 3 060 

GCATCATTGA CATTACCGCG GGCCTCTCCG GGGATGTGGT GTCAATCGAA TTCAAGAACA 312 0 

CTGGAAGCAT CCCAAAAGAT AAGCTAGCTG CCATATTTGA AAAGTTCTAT AGGCTGGACA 318 0 

ATTCTCGTTC TTCCGATACG GGTGGCGCGG GACTTGGATT GGCGATTGCA AAAGAAATTA 3 24 0 

TTGTTCAGCA TGGAGGGCAG ATTTACGCGG AAAGCTATGA TAACTATACG ACGTTTAGGG 3 3 00 

TAGAGCTTCC AGCGATGCCA GACTTGGTTG ATAAAAGGAG GTCCTAAGAG ATGTATATAA 3 3 60 

TTTTTTAGGA AAATCTCAAG GTTATCTTTA CTTTTTCTTA GGAAATTAAC AATTTAATAT 34 20 

TAAGAAACGG CTCGTTCTTA CACGGTAGAC TTAATACCGT AAGAACGAGC CGTTTTCGTT 34 8 0 

CTTCAGAGAA AGATTTGACA AGATTACCAT TGGCATCCCC GTTTTATTTG GTGCCTTTCA 3 54 0 

CAGAAAGGGT TGGTCTTAAT TATGAATAAC ATCGGCATTA CTGTTTATGG ATGTGAGCAG 3 600 

GATGAGGCAG ATGCATTCCA TGCTCTTTCG CCTCGCTTTG GCGTTATGGC AACGATAATT 3 660 

AACGCCAACG TGTCGGAATC CAACGCCAAA TCCGCGCCTT TCAATCAATG TATCAGTGTG 3 72 0 

GGACATAAAT C AG AG ATTT C CGCCTCTATT CTTCTTGCGC TGAAGAGAGC CGGTGTGAAA 3 780 

TATATTTCTA CCCGAAGCAT CGGCTGCAAT CATATAGATA CAACTGCTGC TAAGAGAATG 3 84 0 

GGCATCACTG TCGACAATGT GGCGTACTCG CCGGATAGCG TTGCCGATTA TACTATGATG 3 9 00 

CTAATTCTTA TGGCAGTACG CAACGTAAAA TCGATTGTGC GCTCTGTGGA AAAACATGAT 3 9 60 

TTCAGGTTGG ACAGCGACCG TGGCAAGGTA CTCAGCGACA TGACAGTTGG TGTGGTGGGA 4 02 0 

ACGGGCCAGA TAGGCAAAGC GGTTATTGAG CGGCTGCGAG GATTTGGATG TAAAGTGTTG 4 08 0 



GCTTATAGTC 


GCAGCCGAAG 


TATAGAGGTA 


AACTATGTAC 


CGTTTGATGA 


GTTGATGCAA 


4140 


AATAGCGATA 


TCGTTACGCT 


TCATGTGCCG 


CTCAATACGG 


ATACGCACTA 


TATTATCAGC 


4200 


CACGAACAAA 


TACAGAGAAT 


GAAGCAAGGA 


GCATTTCTTA 


TCAATACTGG 


GCGCGGTCCA 


4260 


CTTGTAGATA 


CCTATGAGTT 


GGTTAAAGCA 


TTAGAAAACG 


GGAAACTGGG 


CGGTGCCGCA 


4320 


TTGGATGTAT 


TGGAAGGAGA 


GGAAGAGTTT 


TTCTACTCTG 


ATTGCACCCA 


AAAACCAATT 


4380 


GATAATCAAT 


TTTTACTTAA 


ACTTCAAAGA 


ATGCCTAACG 


TGATAATCAC 


ACCGCATACG 


4440 


GCCTATTATA 


CCGAGCAAGC 


GTTGCGTGAT 


ACCGTTGAAA 


AAACCATTAA 


AAACTGTTTG 


4500 


GATTTTGAAA 


GGAGACAGGA 


GCATGAATAG 


AATAAAAGTT 


GCAATACTGT 


TTGGGGGTTG 


4560 


CTCAGAGGAG 


CATGACGTAT 


CGGTAAAATC 


TGCAATAGAG 


ATAGCCGCTA 


ACATTAATAA 


4620 


AGAAAAATAC 


GAGCCGTTAT 


ACATTGGAAT 


TACGAAATCT 


GGTGTATGGA 


AAATGTGCGA 


4680 


AAAACCTTGC 


GCGGAATGGG 


AAAACGACAA 


TTGCTATTCA 


GCTGTACTCT 


CGCCGGATAA 


4740 


AAAAATGCAC 


GGATTACTTG 


TTAAAAAGAA 


CCATGAATAT 


GAAATCAACC 


ATGTTGATGT 


4800 


AGCATTTTCA 


GCTTTGCATG 


GCAAGTCAGG 


TGAAGATGGA 


TCCATACAAG 


GTCTGTTTGA 


4860 


ATTGTCCGGT 


ATCCCTTTTG 


TAGGCTGCGA 


TATTCAAAGC 


TCAGCAATTT 


GTATGGACAA 


4920 


ATCGTTGACA 


TACATCGTTG 


CGAAAAATGC 


TGGGATAGCT 


ACTCCCGCCT 


TTTGGGTTAT 


4980 


TAATAAAGAT 


GATAGGCCGG 


TGGCAGCTAC 


GTTTACCTAT 


CCTGTTTTTG 


TTAAGCCGGC 


5040 


GCGTTCAGGC 


TCATCCTTCG 


GTGTGAAAAA 


AGTCAATAGC 


GCGGACGAAT 


TGGACTACGC 


5100 


AATTGAATCG 


GCAAGACAAT 


ATGACAGCAA 


AATCTTAATT 


GAGCAGGCTG 


TTTCGGGCTG 


5160 


TGAGGTCGGT 


TGTGCGGTAT 


TGGGAAACAG 


TGCCGCGTTA 


GTTGTTGGCG 


AGGTGG AC C A 


5220 


AATCAGGCTG 


CAGTACGGAA 


TCTTTCGTAT 


TCATCAGGAA 


GTCGAGCCGG 


AAAAAGGCTC 


5280 


TGAAAACGCA 


GTTATAACCG 


TTCCCGCAGA 


CCTTTCAGCA 


GAGGAGCGAG 


GACGGATACA 


5340 


GGAAACGGCA 


AAAAAAATAT 


ATAAAGCGCT 


CGGCTGTAGA 


GGTCTAGCCC 


GTGTGGATAT 


5400 


GTTTTTACAA 


GATAACGGCC 


GCATTGTACT 


GAACGAAGTC 


AATACTCTGC 


CCGGTTTCAC 


5460 


GTCATACAGT 


CGTTATCCCC 


GTATGATGGC 


CGCTGCAGGT 


ATTGCACTTC 


CCGAACTGAT 


5520 


TGACCGCTTG 


ATCGTATTAG 


CGTTAAAGGG 


GTGATAAGCA 


TGGAAATAGG 


ATTTACTTTT 


5580 


TTAGATGAAA 


TAGTACACGG 


TGTTCGTTGG 


GACGCTAAAT 


ATGCCACTTG 


GGATAATTTC 


5640 


ACCGGAAAAC 


CGGTTGACGG 


TTATGAAGTA 


AATCGCATTG 


TAGGGACATA 


CGAGTTGGCT 


5700 


GAATCGCTTT 


TGAAGGCAAA 


AGAACTGGCT 


GCTACCCAAG 


GGTACGGATT 


GCTTCTATGG 


5760 


GACGGTTACC 


GTCCTAAGCG 


TGCTGTAAAC 


TGTTTTATGC 


AATGGGCTGC 


ACAGCCGGAA 


5820 



A ATA ACCTGA 
. nn J. nnv^u x wn 


CAAAGGAAAG 


TT ATT AT C C C 


AATATTGACC 


GAACTGAGAT 


GATTTCAAAA 

VJxx XXX viUxfxxl 


58 R n 


OUn X n ww J- ww 


CTTCAAAATC 


AAG C CAT AG C 

XXXX VJ V«» V^XX A XX %J 


CGCGGCAGTG 

V»- Vj V^VJVj V^xiVJ X >J 


CCATTGATCT 

V«> V^xx X X Vjxx X V-» X 


T APGPTTT AT 
invfVJvi x ini 


5940 


wwAX J. Aw ri w a 


CGGGTGAGCT 

V_* Vj VJVJ X w«w V X 


TGTACCAATG 

X ^J X XX V^ N^**x* ^ VJ 


GGGAGCCGAT 


TTGATTTTAT 

x x vjn x x x x XX x 


GGATGA A PGP 
wwxrvx onnv«u\> 


6000 


1 vl wn X w A X w 


PGGCAAATGG 

wwwwn^^nx vjvj 


AATATCATGC 

r\n x xx x v»»xx x vj v»- 


AATGAAGCGC 

nni wn-nwwww 


AAAATCGCAG 

nrmn x w w v^avj 


APGTTTGPGP 

Aw w X X X w w w w 


6060 


X W w A X ^rt X VJVJ 


AAAACAGTGG 

nnnnvnw x vj vj 


GTTTGAAGCA 

VJ X X A W«i*i\J x*f*XX 


TATAGCCTCG 

X XX X nw >■» X VVJ 


AATGGTGGCA 

XX-XX ^ W X VJV3 XX 


CTATGTATTA 

W X xl X w X x^. X X xx 


612 0 


A C A G A PG AAf 


PATAPPCCAA 


TAGCTATTTT 

x nww x xx x x x x 


GATTTCCCCG 

vjn x x x vVufV^vvj 


TT AAAT AAA C 


TTTTAACCGT 


6180 


fpGrArGGAPA 


A APT AT AT A A 
nn w x n x n x An 


GPTAACTCTT 

w\« x nnv x x x 


TCGGCAGG AA 

x w\jw vAwunn 


ACCCGACGTA 

XX ^J XX >J 1 X X 


TGTAACTGGT 

x vj x nn w x vj vj x 


6240 


TPTTAC^GAA 


TTT AT AT AT A 

XXX -A x n X xx x n, 


GTAGATAGTA 

vj x nvj/* x xxvj x xx 


TTGAAGATGT 

x x wri/iVjri x w x 


AAGG C AG AG C 


GATATTGCGG 

nJxx x. x^. X X v7Ww 


6300 


T P A T T A T P T G 
X w A x lnl^lw 


WW X WW WW X WW 


GGCAAGATAG 


CCTGATAATA 

w w X VJA X xxxx X x^. 


AGACTGATCG 

XxVJxx V— X VJxX X W 


CATAGAGGGG 

>— »xx X xxvJxxVJ VJ VJ VJ 


6360 


X ww Ini. 1 1 wA 


PAPPGPPPAT 

wn W WW w w w^A X 


TGTPAAPAGG 
x w x v«nn w n.w w 


PAGTTPAGPP 

vAVJ X X V/ivJ W W 


TCGTTAAATT 

XwVJX X xxxxxx A. X 


CAGCATGGGT 

v-oxxVJ V_»xx X Vj Vj vj X 


6420 


7\fpr a ptt a tcz 

n X wn w X X n X w 


A A A ATTPATP 
nnAA X X w A x w 


TAPATTGGTG 

XAvAl X w w X Vj 


ATAATAGTAA 
c\ x x *ri w x nn 


ATCCAGTAGG 

xx x w vnvj x x^.w vj 


GCGAAATAAT 

vj vvnnn x xx^x x 


6480 


TGAPTCTAAT 
X unL X Vj X nn X 


ttapggggpa 

X XAwwwVjwwA 


AAACGGCACA 

AAAw \JVJ wAwA 


ATCTPAAACG 
xx x w x \»nnnvvj 


AGATTGTGCC 

XxVJxx X l\71wvV 


GTTTAAGGGG 

VJ X X -X. XiXxVJ VJ VJ VJ 


6540 


AACATTPTAG 
nnbn X X w X Aw 


A A ATATTTPA 

Ann X A XXX wA 


TACTTCCAAC 

X^xwX 1 vvAAv 


TATATAGTTA 

x x\ x n x Aw x x n 


AGGAGGAGAC 


TGAAAATGAA 

•A. VJ XX_X XX XXX -X, VJ Xxx X 


6600 


G A A GTTGTTT 
unxio X X \j X XX 


TTTTT ATTGT 

X X X X X A X X w X 


TATTGTTATT 

X xx X X w X X xx X X 


CTTAATATAC 


TTAGGTTATG 

X X ZiVJVJ X X xx X w 


ACTACGTTAA 

nwxnvwx -L. xxxx 


6660 


tga agpaptg 

X vjnnwwnw X Vj 


TTTTPTPAGG 

X X X X w X \*A\jw 


AAAAAGTCGA 

fU^AXVAw X ww.^x 


ATTTCAAAAT 

n xxx w fuum x 


TATGATCAAA 

x XX x vjxx x vfinn 


ATCCCAAAGA 

X X .X. Vi> V^.» ViliUiWl X 


6720 


APATTTAGAA 
nv»n x x x nvj/in 


AATAGTGGGA 

xxxx x r^w x vj vj 


CTTCTGAAAA 

w- X X >v X 


TACCCAAGAG 


AAAACAATTA 


CAGAAGAACA 

Vp^XX VJ XXXX VJ XXXX V* XX 


6780 


GGTTTATCAA 

VJ VJ XXX xx X vilfl 


GGAAATCTGC 

WJfWl X >w X VJ >•»» 


TATTAATCAA 

x xx x x nn x vxxn 


TAGTAAATAT 

X xx VJ x fUua J. XX ^ 


CCTGTTCGCC 


AAGAAGTGTG 


6840 


AAGTPAGATA 

fUlVJ X V_,xT.vj.n. X xx 


TCGTGAATTT 

x ^> vj x vjxaxx xxx 


ATCTAAACAT 

xx x w- x nnnun x 


GACGAATTAA 

vjnv»vjiin x inn 


TAAATGGATA 

A. XXXXXX X \J \JXX L XX 


CGGGTTGCTT 

V^ VJ VJ VJ x 1 w V .X- L 


6900 


GATAGTAATA 
x rivj x rin x xx 


TTTATATGTP 

XXX ^\ X xx X Vj X >■•» 


AAA AG A A ATA 


GPAPAAAAAT 


TTTCAGAGAT 


GGT C AATG AT 

vj VJ V>^» xxxx X VJxx X 


6960 


GCTGTAAAGG 

ww x w x rvfxf^vjvj 


GTGGCGTTAG 

Vj X Vj VJ V^ VJ X X XiVJ 


TCATTTTATT 
x wn. x x x x x^ x x 


ATTAATAGTG 

x* x x .n-n. x xx vj x w 


GCT AT CG AG A 

W Vp* x xx x xjxx Vm# Xx 


CTTTGATGAG 

v»- xxx VJxx x vjxx vj 


7020 


CAAAGTGTGC 


TTTACCAAGA 


AATGGGGGCT 


GAGTATGCCT 


TACCAGCAGG 


TTATAGTGAG 


7080 


CATAATTCAG 


GTTTATCACT 


AGATGTAGGA 


TCAAGCTTGA 


CGAAAATGGA 


ACGAGCCCCT 


7140 


GAAGGAAAGT 


GGATAGAAGA 


AAATGCTTGG 


AAATACGGGT 


TCATTTTACG 


TTATCCAGAG 


7200 


GACAAAACAG 


AGTTAACAGG 


AATTC 








7225 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



GGGGTAGCGT 


CAGGAAAATG 


CGGATTTACA 


ACGCTAAGCC 


TATTTTCCTG 


ACGAATCCCT 


60 


CGTTTTTAAC 


AACGTTAAGA 


AAGTTTTAGT 


GGTCTTAAAG 


AATTTAATGA 


GACTACTTTC 


120 


TCTGAGTTAA 


AATGGT ATT C 


TCCTAGTAAA 


TTAATATGTT 


CCCAACCTAA 


GGGCGACATA 


180 


TGGTGTAACA 


AATCTTCATT 


AAAGCTACCT 


GTCCGTTTTT 


TATATTCAAC 


TGCTGTTGTT 


240 


AGGTGGAGAG 


TATTCCAAAT 


ACTTATAGCA 


TTGATAATTA 


TGTTTAAAGC 


ACTGGCTCTT 


300 


TGCAATTGAT 


GCTGTATGGT 


GCGTTCTCTA 


AGCTCACCTT 


GTTTTCCGAA 


GAAAATAGCT 


360 


» CTTGCCAATC 


CATTCATGGC 


TTCTCCTTTA 


TTCAATCCTC 


TTTGTATTTT 


TCTTCTTAAT 


420 


* GATTCATCCG 


ATATATAATT 


CAAAATAAAG 


ATCGTTTTTT 


CTATTCGGCC 


CATCTCACGT 


480 


AAGGCTGTAG 


CTAAGCTGTT 


TTGTCTTGAA 


TAGGAACCTA 


GCTTCCCCAT 


AATAAGGGAT 


540 


G CTG AAACTG 


TTCCCTCCCT 


TATAGAATGA 


GCTAATCGCA 


AAACATCCTC 


ATAATTTTCT 


600 


TTAATGACCT 


TTGTATTTAT 


TTGTCCACGT 


AAAATGGCTT 


CTAGTTTTGG 


ATACTCACTT 


660 


GCTTTATCTA 


TCGTAAATAA 


TTTTGAGTCC 


GATAAATCCC 


TTATTCTTGG 


GGCAAATTTA 


720 


AATCCTAATA 


AATGAGTCAG 


TCCGAATATT 


TGGTCAGTGT 


AACCGGCAGT 


GTCTGTATAA 


780 


TGTTCCTCTA 


TGTTTAGATC 


CGTCTCATGA 


TGTAACAAAC 


CATCCAAAAC 


ATGAATCGCA 


840 


TCTCTTGAAT 


TAGTATGAAT 


AATCTTTGTG 


TAGTAAGAAG 


AGAATTGATC 


ACTTGTAAAT 


900 


CGGTAGATGG 


TGGCTCCTTT 


TCCAGTTCCA 


TAATGTGGAT 


TTGCATCTGC 


ATGTAGTGAT 


960 


GAAACACCTA 


GCTGCATTCT 


CATACCATCT 


GACGAAGATG 


TTGTACCGTC 


GCCCCAATAG 


1020 


AAAGGCAATT 


GTAATTTATG 


ATGAAAGTTT 


ACTAATATGG 


CTTGGGCTTT 


ATTCATGGCA 


1080 


TCTTCATACA 


TGCGCCATTG 


AGATACATTG 


GCTAGTTGCT 


TATATGTAAG 


TCCGGGTGTG 


1140 


GCTTCGGCCA 


TCTTGCTCAA 


GCCAATATTC 


ATTCCCATTC 


CTAAAAGGGC 


AGCCATGATA 


1200 


ATGATTGTTT 


CTTCCTTATC 


TGGTTTTCGA 


TTATTGGAAG 


CATGAGTGAA 


TTGCTCATGA 


1260 


AATCCTGTTA 


TATGGGCCAC 


ATCCATGAGT 


7v t\ 7\ m /*» tv mm tv 

AAAT CAGTTA 


Al 1 1 1L1 




i o o n 

1 J z u 


TGATAAAGGC 


TTGCACTAAA 


TTTTTTTGCT 


TCTTCTGGAA 


CATCTTTTTC 


TAAGCGTGCA 


1380 


AGTGATAGCT 


TTCCTTTTTC 


AAGAGAAACC 


CCATCTAACT 


TATTGGAATT 


GGCAGCTAAC 


1440 


CACTTTAACC 


TTT C ATT AAA 


GCTGCTGGTT 


CTCTCCGTTA 


TATAATCTTC 


GAATGATAAA 


1500 


CTAACTGATA 


ATCTCGTATT 


CCCCTTCGAT 


TGATTCCATG 


TATCTTCCGA 


AAACAAATAT 


1560 


TCCTCAAAAT 


CCCTATATTG 


TCTGCTGCCA 


ACAATGGAAA 


CATCTCCTGC 


CCGAACATGC 


1620 



TCCCGAAGTT CTGTTAAAAC AGCCATTTCA TAGTAATGAC GATTAATTGT TGTACCATCA 168 0 

TCCTCGTATA AATGTCTTTT CCATCGTTTT GAAATAAAAT CCACAGGTGA GTCATCAGGC 17 4 0 

ACTTTTCGCT TTCCAGATTC GTTCATTCCT CGGATAATCT CAACAGCTTG TAAAAGTGGC 18 00 

TCATTTGCCT TTGTAGAATG AAATTCCAAT ACTCTTAATA GCGTTGGCGT AT ATTTT CTT 18 60 

AGTGAATAAA ACCGTTTTTG CAGTAAGTCT AAATAATCAT AGTCGGCAGG ACGTGCAAGT 192 0 

TCCTGAGCCT CTTCTACTGA AGAGACAAAG GTATTCCATT CAATAACCGA TTCTAAAACC 198 0 

TTAAAAACGT CTAATTTTTC CTCTCTTGCT TTAATTAATG CTTGTCCGAT GTTCGTAAAG 2 04 0 

TGTATAACTT TCTCATTTAG CTTTTTACCG TTTTGTTTCT GGATTTCCTC TTGAGCCTTA 2100 

m CGACCTTTTG ATAACAAACT AAGTATTTGC CTATCATGAA TTTCAAACGC TTTATCCGTT 2160 

* AGCTCCTGAG TAAGTTGTAA TAAATAGATG GTTAATATCG AATAACGTTT ATTTTCTTGA 22 2 0 

AAGTCACGGA ATGCATACGG CTCGTATCTT GAGCCTAAGC GAGACAGCTG CAACAGGCGG 2280 

TTACGGTGCA AATGACTAAT TTGCACTGTT TCTAAATCCA TTCCTCGTAT GTATTCGAGT 2 3 40 

CGTTCTATTA TTTTTAGAAA AGTTTCGGGT GAAGGATGAC CCGGTGGCTC TTTTAACCAA 24 00 

CCCAATATCG TTTTATTGGA TTCGGATGGA TGCTGCGAGG TAATAATCCC TTCAAGCTTT 24 60 

TCTTTTTGCT CATTTGTTAG AGATTTACTA ACCGTATTAA ATAGCTTCTT TTCAGCCATT 2 52 0 

GCCCTTGCTT CCCACACCAT TCTTTCAAGT GTAGTGATAG CAGGCAGTAT AATTTTGTTT 2 58 0 

TTTCTTAGAA AATCTATGCA TTCATGCAGT AGATGAATGG CATCACCATT TTCCAAAGCT 2 64 0 

AATTGATGAA GGTACTTAAA TGTCATTCGA TATTCACTCA GGGTAAAAGT TACAAAGTCG 27 00 

TATTCACTTC GAATTTCTTT CAAATGATCC CAAAGTGTAT TTTCCCTTTG AGGATAATGA 27 60 

TCAAGCGAGG ATGGACTAAC ACCAATCTGT TTCGATATAT ATTGTATGAC CGAATCTGGG 282 0 

ATG CTTTTGA TATGAGTGTA TGGCCAACCG GGATACCGAA GAACAGCTAA TTGAACAGCA 288 0 

AATCCTAAAC GGTTTTCTTC CCTCCTTCGC TTATTAACTA TTTCTAAATC CCGTTTGGAA 294 0 

AAAGTGAAGT AGGTCCCCAG TATCCATTCA TCTTCAGGGA TTTGCATAAA AGCCTGTCTC 3 000 

TGTTCCGGTG TAAGCAATTC TCTACCTCTC GCAATTTTCA TTCAGTATCA TTCCATTTCT 3 060 

GTATTTTCAA TTTATTAGTT CAATTATATA TCAATAGAGT GTACTCTATT GATACAAATG 312 0 

TAGTAGACTG ATAAAATCAT AGTTAAGAGC GTCTCATAAG ACTTGTCTCA AAAATGAGGT 318 0 

GATATTTTGC GGAAAATCGG TTATATTCGT GTCAGTTCGA CTAACCAGAA TCCTTCAAGA 3 24 0 

CAATTTCAGC AGTTGAACGA GATCGGAATG GATATTATAT ATGAAGAGAA AGTTTCAGGA 3 3 00 

GCAACAAAGG ATCGCGAGCA ACTTCAAAAA GTGTTAGACG ATTTACAGGA AGATGACATC 3 3 60 



ATTTATGTTA CAGACTTAAC TCGAATCACT CGTAGTACAC AAGATCTATT TGAATTAATC 34 2 0 

GATAACATAC GAGATAAAAA GGCAAGTTTA AAATCACTAA AAGATACATG GCTTGATTTA 348 0 

TCAGAAGATA ATCCATACAG CCAATTCTTA ATTACTGTAA TGGCTGGTGT TAACCAATTA 3 54 0 

GAGCGAGATC TTATTCGGAT GAGACAACGT GAAGGGATTG AATTGGCTAA GAAAGAAGGA 3 60 0 

AAGTTTAAAG GTCGATTAAA GAAGTATCAT AAAAATCACG CAGGAATGAA TTATGCGGTA 3 660 

AAGCTATATA AAGAAGGAAA TATGACTGTA AATCAAATTT GTGAAATTAC TAATGTATCT 37 2 0 

AGGGCTTCAT TATACAGGAA ATTATCAGAA GTGAATAATT AGCCATTCTG TATTCCGCTA 3780 

ATGGGCAATA TTTTTAAAGA AGAAAAGGAA ACTATAAAAT ATTAACAGCC T C CT AG CG AT 3840 

GCCGAAAAGC CCTTTGATAA AAAAAGAATC ATCATCTTAA GAAATTCTTA GTCATTTATT 3 9 00 

ATGTAAATGC TTATAAATTC GGCCCTATAA TCTGATAAAT TATTAAGGGC AAACTTATGT 3 9 60 

GAAAGGGTGA TAACTATGAG CGATAAAATA CTTATTGTGG ATGATGAACA TGAAATTGCC 4 02 0 

GATTTGGTTG AATTATACTT AAAAAACGAG AATTATACGG TTTTCAAATA CTATACCGCC 4 080 

AAAGAAGCAT TGGAATGTAT AGACAAGTCT GAGATTGACC TTGCCATATT GGACATCATG 414 0 

CTTCCCGGCA CAAGCGGCCT TACTATCTGT CAAAAAATAA GGGACAAGCA CACCTATCCG 4 2 00 

ATTATCATGC TGACCGGGAA AGATACAGAG GTAGATAAAA TTACAGGGTT AACAATCGGC 4 2 60 

GCGGATGATT ATATAACGAA GCCCTTTCGC CCACTGGAGT TAATTGCTCG GGTAAAGGCC 432 0 

CAGTTGCGCC GATACAAAAA ATTCAGTGGA GTAAAGGAGC AGAACGAAAA TGTTATCGTC 4 380 

CACTCCGGCC TTGTCATTAA TGTTAACACC CATGAGTGTT ATCTGAACGA GAAGCAGTTA 4440 

TCCCTTACTC CCACCGAGTT TTCAATACTG CGAATCCTCT GTGAAAACAA GGGGAATGTG 4 500 

GTTAGCTCCG AGCTGCTATT TCATGAGATA TGGGGCGACG AATATTTCAG CAAGAGCAAC 4 560 

AACACCATCA CCGTGCATAT CCGGCATTTG CGCGAAAAAA TGAACGACAC CATTGATAAT 4 62 0 

CCGAAATATA TAAAAACGGT ATGGGGGGTT GGTTATAAAA TTGAAAAATA AAAAAAACGA 4 680 

CTATTCCAAA CTAGAACGAA AACTTTACAT GTATATCGTT GCAATTGTTG TGGTAGCAAT 4 74 0 

TGTATTCGTG TTGTATATTC GTTCAATGAT CCGAGGGAAA CTTGGGGATT GGATCTTAAG 4 8 00 

TATTTTGGAA AACAAATATG ACTTAAATCA CCTGGACGCG ATGAAATTAT ATCAATATTC 4 8 60 

CATACGGAAC AATATAGATA TCTTTATTTA TGTGGCGATT GTCATTAGTA TTCTTATTCT 4920 

ATGTCGCGTC ATGCTTTCAA AATTCGCAAA ATACTTTGAC GAGATAAATA CCGGCATTGA 4 980 

TGTACTTATT CAGAACGAAG AT AAA C AAAT TGAGCTTTCT GCGGAAATGG ATGTTATGGA 5 04 0 

ACAAAAGCTC AACACATTAA AACGGACTCT GGAAAAGCGA GAGCAGGATG CAAAGCTGGC 5100 



CGAACAAAGA AAAAATGACG TTGTTATGTA CTTGGCGCAC GATATTAAAA CGCCCCTTAC 5160 

ATCCATTATC GGTTATTTGA GCCTGCTTGA CGAGGCTCCA GACATGCCGG TAGATCAAAA 52 2 0 

GGCAAAGTAT GTGCATATCA CGTTGGACAA AGCGTATCGA CTCGAACAGC TAATCGACGA 528 0 

GTTTTTTGAG ATTACACGGT ATAACCTACA AACGATAACG CTAACAAAAA CGCACATAGA 53 40 

CCTATACTAT ATGCTGGTGC AG ATG AC CG A TGAATTTTAT CCTCAGCTTT CCGCACATGG 54 00 

AAAACAGGCG GTTATTCACG CCCCCGAGGA TCTGACCGTG TCCGGCGACC CTGATAAACT 54 60 

CGCGAGAGTC TTTAACAACA TTTTGAAAAA CGCCGCTGCA TACAGTGAGG ATAACAGCAT 552 0 

CATTGACATT ACCGCGGGCC TCTCCGGGGA TGTGGTGTCA ATCGAATTCA AGAACACTGG 5580 

AAGCATCCCA AAAGATAAGC TAGCTGCCAT ATTTGAAAAG TTCTATAGGC TGGACAATGC 5640 

TCGTTCTTCC GATACGGGTG GCGCGGGACT TGGATTGGCG ATTGCAAAAG AAATTATTGT 57 00 

TCAGCATGGA GGGCAGATTT ACGCGGAAAG CAATGATAAC TATACGACGT TTAGGGTAGA 57 60 

GCTTCCAGCG ATGCCAGACT TGGTTGATAA AAGGAGGTCC TAAGAGATGT ATATAATTTT 582 0 

TTAGGAAAAT CTCAAGGTTA TCTTTACTTT TTCTTAGGAA ATTAACAATT TAATATTAAG 58 80 

AAACGGCTCG TTCTTACACG GTAGACTTAA TACCGTAAGA ACGAGCCGTT TTCGTTCTTC 594 0 

AGAGAAAGAT TTGACAAGAT TACCATTGGC ATCCCCGTTT TATTTGGTGC CTTTCACAGA 6000 

AAGGGTTGGT CTTAATTATG AATAACATCG GCATTACTGT TTATGGATGT GAGCAGGATG 6060 

AGGCAGATGC ATTCCATGCT CTTTCGCCTC GCTTTGGCGT TATGGCAACG ATAATTAACG 612 0 

CCAACGTGTC GGAATCCAAC GCCAAATCCG CGCCTTTCAA TCAATGTATC AGTGTGGGAC 6180 

ATAAATCAGA GATTTCCGCC TCTATTCTTC TTGCGCTGAA GAGAGCCGGT GTGAAATATA 624 0 

TTTCTACCCG AAGCATCGGC TGCAATCATA TAGATACAAC TGCTGCTAAG AGAATGGGCA 63 00 

TCACTGTCGA CAATGTGGCG TACTCGCCGG ATAGCGTTGC CGATTATACT ATG ATG CT AA 63 60 

TTCTTATGGC AGTACGCAAC GTAAAATCGA TTGTGCGCTC TGTGGAAAAA CATGATTTCA 642 0 

GGTTGGACAG CGACCGTGGC AAGGTACTCA GCGACATGAC AGTTGGTGTG GTGGGAACGG 6480 

G C C AG AT AGG CAAAGCGGTT ATTGAGCGGC TGCGAGGATT TGGATGTAAA GTGTTGGCTT 654 0 

ATAGTCGCAG CCGAAGTATA GAGGTAAACT ATGTACCGTT TGATGAGTTG CTGCAAAATA 6600 

GCGATATCGT TACGCTTCAT GTGCCGCTCA ATACGGATAC GCACTATATT ATCAGCCACG 6660 

AACAAATACA GAGAATGAAG CAAGGAGCAT TTCTTATCAA TACTGGGCGC GGTCCACTTG 67 2 0 

TAGATACCTA TGAGTTGGTT AAAG C ATT AG AAAACGGGAA ACTGGGCGGT GCCGCATTGG 67 80 

ATGTATTGGA AGGAGAGGAA GAGTTTTTCT ACTCTGATTG CACCCAAAAA CCAATTGATA 68 4 0 



• ATCAATTTTT ACTTAAACTT CAAAGAATGC CTAACGTGAT AATCACACCG CATACGGCCT 69 00 

ATT AT AC CG A GCAAGCGTTG CGTGATACCG TTGAAAAAAC CATTAAAAAC TGTTTGGATT 69 60 

TTGAAAGGAG ACAGGAGCAT GAATAGAATA AAAGTTGCAA TACTGTTTGG GGGTTGCTCA 7 02 0 

GAGGAGCATG ACGTATCGGT AAAATCTGCA ATAGAGATAG CCGCTAACAT TAATAAAGAA 7 080 

AAATACGAGC CGTTATACAT TGGAATTACG AAATCTGGTG TATGGAAAAT GTGCGAAAAA 714 0 

CCTTGCGCGG AATGGGAAAA CGACAATTGC TATTCAGCTG TACTCTCGCC GGATAAAAAA 72 00 

ATGCACGGAT TACTTGTTAA AAAGAACCAT GAATATGAAA TCAACCATGT TGATGTAGCA 7 2 60 

TTTTCAGCTT TGCATGGCAA GTCAGGTGAA GATGGATCCA TACAAGGTCT GTTTGAATTG 732 0 

TCCGGTATCC CTTTTGTAGG CTGCGATATT CAAAGCTCAG CAATTTGTAT GGACAAATCG 7 3 80 

TTGACATACA TCGTTGCGAA AAATGCTGGG ATAGCTACTC CCGCCTTTTG GGTTATTAAT 74 4 0 

AAAGATGATA GGCCGGTGGC AGCTACGTTT ACCTATCCTG TTTTTGTTAA GCCGGCGCGT 7 500 

TCAGGCTCAT CCTTCGGTGT GAAAAAAGTC AATAGCGCGG ACGAATTGGA CTACGCAATT 7 560 

GAATCGGCAA GACAATATGA CAGCAAAATC TTAATTGAGC AGGCTGTTTC GGGCTGTGAG 7 620 

GTCGGTTGTG CGGTATTGGG AAACAGTGCC GCGTTAGTTG TTGGCGAGGT GGACCAAATC 7 680 

AGGCTGCAGT ACGGAATCTT TCGTATTCAT CAGGAAGTCG AGCCGGAAAA AGGCTCTGAA 774 0 

AACGCAGTTA TAACCGTTCC CGCAGACCTT TCAGCAGAGG AGCGAGGACG GATACAGGAA 7 8 00 

ACGGCAAAAA AAATATATAA AGCGCTCGGC TGTAGAGGTC TAGCCCGTGT GGATATGTTT 7 860 

TTACAAGATA ACGGCCGCAT TGTACTGAAC GAAGTCAATA CTCTGCCCGG TTTCACGTCA 7 92 0 

TACAGTCGTT ATCCCCGTAT GATGGCCGCT GCAGGTATTG CACTTCCCGA ACTGATTGAC 7 98 0 

CGCTTGATCG TATTAGCGTT AAAGGGGTGA TAAGCATGGA AATAGGATTT ACTTTTTTAG 8 04 0 

ATGAAATAGT ACACGGTGTT CGTTGGGACG CTAAATATGC CACTTGGGAT AATTTCACCG 8100 

GAAAACCGGT TGACGGTTAT GAAGTAAATC GCATTGTAGG GACATACGAG TTGGCTGAAT 8160 

CGCTTTTGAA GGCAAAAGAA CTGGCTGCTA CCCAAGGGTA CGGATTGCTT CTATGGGACG 82 2 0 

GTTACCGTCC TAAGCGTGCT GTAAACTGTT TTATGCAATG GGCTGCACAG CCGGAAAATA 82 8 0 

ACCTGACAAA GGAAAGTTAT TATCCCAATA TTGACCGAAC TGAGATGATT T C AAAAGG AT 83 4 0 

ACGTGGCTTC AAAATCAAGC CATAGCCGCG GCAGTGCCAT TGATCTTACG CTTTATCGAT 84 00 

TAGACACGGG TGAGCTTGTA CCAATGGGGA GCCGATTTGA TTTTATGGAT GAACGCTCTC 84 60 

ATCATGCGGC AAATGGAATA TCATGCAATG AAGCGCAAAA TCGCAGACGT TTGCGCTCCA 8 52 0 

TCATGGAAAA CAGTGGGTTT GAAGCATATA GCCTCGAATG GTGGCACTAT GTATTAAGAG 858 0 



ACGAACCATA CCCCAATAGC TATTTTGATT TCCCCGTTAA ATAAACTTTT AACCGTTGCA 8 64 0 

CGGACAAACT ATATAAGCTA ACTCTTTCGG CAGGAAACCC GACGTATGTA ACTGGTTCTT 87 00 

AGGGAATTTA TATATAGTAG ATAGTATTGA AGATGTAAGG CAGAGCGATA TTGCGGTCAT 87 60 

TATCTGCGTG CGCTGCGGCA AGATAGCCTG ATAATAAGAC TGATCGCATA GAGGGGTGGT 88 20 

ATTTCACACC GCCCATTGTC AACAGGCAGT TCAGCCTCGT TAAATTCAGC ATGGGTATCA 88 80 

CTTATGAAAA TTCATCTACA TTGGTGATAA TAGTAAATCC AGTAGGGCGA AATAATTGAC 894 0 

TGTAATTTAC GGGGCAAAAC GGCACAATCT CAAACGAGAT TGTGCCGTTT AAGGGGAAGA 9000 

TTCTAGAAAT ATTTCATACT TCCAACTATA TAGTTAAGGA GGAGACTGAA AATGAAGAAG 9060 

TTGTTTTTTT TATTGTTATT GTTATTCTTA ATATACTTAG GTTATGACTA CGTTAATGAA 912 0 

GCACTGTTTT CTCAGGAAAA AGTCGAATTT CAAAATTATG ATCAAAATCC CAAAGAACAT 918 0 

TTAGAAAATA GTGGGACTTC TGAAAATACC CAAGAGAAAA CAATTACAGA AGAACAGGTT 92 4 0 

TATCAAGGAA ATCTGCTATT AATCAATAGT AAATATCCTG TTCGCCAAGA AAGTGTGAAG 93 00 

TCAGATATCG TGAATTTATC TAAACATGAC GAATTAATAA ATGGATACGG GTTGCTTGAT 93 60 

AGTAATATTT ATATGTCAAA AGAAATAGCA CAAAAATTTT CAGAGATGGT CAATGATGCT 94 2 0 

GTAAAGGGTG GCGTTAGTCA TTTTATTATT AATAGTGGCT ATCGAGACTT TGATGAGCAA 94 8 0 

AGTGTGCTTT ACCAAGAAAT GGGGGCTGAG TATGCCTTAC CAGCAGGTTA TAGTGAGCAT 9 54 0 

AATTCAGGTT TATCACTAGA TGTAGGATCA AGCTTGACGA AAATGGAACG AGCCCCTGAA 9 600 

GGAAAGTGGA TAGAAGAAAA TGCTTGGAAA TACGGGTTCA TTTTACGTTA TCCAGAGGAC 9660 

AAAACAGAGT TAACAGGAAT TCAATATGAA CCATGGCATA TTCGCTATGT TGGTTTACCA 97 2 0 

CATAGTGCGA TTATGAAAGA AAAGAATTTC GTTCTCGAGG AATATATGGA TTACCTAAAA 9780 

GAAGAAAAAA CCATTTCTGT TAGTGTAAAT GGGGAAAAAT ATG AG AT CTT TTATTATCCT 984 0 

GTTACTAAAA ATACCACCAT TCATGTGCCG ACTAATCTTC GTTATGAGAT ATCAGGAAAC 99 00 

AATATAGACG GTGTAATTGT GACAGTGTTT CCCGGATCAA CACATACTAA TTCAAGGAGG 99 60 

TAAGGATGGC GGAATGAAAC CAACGAAATT AATGAACAGC ATTATTGTAC TAGCACTTTT 100 2 0 

GGGGTAACGT TAGCTTTTTA ATTTAAAACC CACGTTAACT AGGACATTGC TATACTAATG 1008 0 

ATACAACTTA AACAAAAGAA TTAGAGGAAA TTATATTGGG AAAAATATTA TCTAGAGGAT 1014 0 

TGCTAGCTTT ATATTTAGTG ACACTAATCT GGTTAGTGTT ATTCAAATTA CAATACAATA 102 00 

TTTTATCAGT ATTTAATTAT CAT C AAAG AA GTCTTAACTT GACTCCATTT ACTGCTACTG 102 60 

GGAATTTCAG AGAGATGATA GATAATGTTA TAATCTTTAT TCCATTTGGC TTG CTTTTG A 103 2 0 



-ATGTCAATTT TAAAGAAATC GGATTTTTAC CTAAGTTTGC TTTTGTACTG GTTTTAAGTC 1038 0 

TTACTTTTGA AATAATTCAA TTTATCTTCG CTATTGGAGC GACAGACATA ACAGATGTAA 10440 

TTACAAATAC TGTTGGAGGC TTTCTTGGAC TGAAATTATA TGGTTTAAGC AATAAGCATA 10500 

TGAATCAAAA AAAATTAGAC AGAGTTATTA TTTTTGTAGG TATACTTTTG CTCGTATTAT 10560 

TGCTCGTTTA CCGTACCCAT TTAAGAATAA ATTACGTGTA AGATGTCTAA ATCAAGCAAT 10620 

CTGATCTTTC ATACACATAA AGATATTGAA TGAATTGGAT TAGATGGAAA ACGGGATGTG 10680 

GGGAAACTCG CCCGTAGGTG TGAAGTGAGG GGAAAACCGG TGATAAAGTA AAAAGCTTAC 10740 

CTAACACTAT AGTAACAAAG AAAGCCCAAT TATCAATTTT AGTGCTGAGG AATTGGTCTC 108 00 

TTTAATAAAT TTCCTTAACG TTGTAAATCC GCATTTTCCT GACGGTACCC C 10851 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2667 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CAAGATTACC ATTGGCATCC CCGTTTTATT TGGTGCCTTT CACAGAAAGG GTTGGTCTTA 60 

ATTATGAATA ACATCGGCAT TACTGTTTAT GGATGTGAGC AGGATGAGGC AGATGCATTC 12 0 

CATGCTCTTT CGCCTCGCTT TGGCGTTATG GCAACGATAA TTAACGCCAA CGTGTCGGAA 18 0 

TCCAACGCCA AATCCGCGCC TTTCAATCAA TGTATCAGTG TGGGACATAA ATCAGAGATT 24 0 

TCCGCCTCTA TTCTTCTTGC GCTGAAGAGA GCCGGTGTGA AATATATTTC TACCCGAAGC 3 00 

ATCGGCTGCA ATCATATAGA TACAACTGCT GCTAAGAGAA TGGGCATCAC TGTCGACAAT 3 60 

GTGGCGTACT CGCCGGATAG CGTTGCCGAT TATACTATGA TGCTAATTCT TATGGCAGTA 42 0 

CGCAACGTAA AATCGATTGT GCGCTCTGTG GAAAAACATG ATTTCAGGTT GGACAGCGAC 48 0 

CGTGGCAAGG TACTCAGCGA CATGACAGTT GGTGTGGTGG GAACGGGCCA GATAGGCAAA 54 0 

GCGGTTATTG AGCGGCTGCG AGGATTTGGA TGTAAAGTGT TGGCTTATAG TCGCAGCCGA 600 

AGTATAGAGG TAAACTATGT ACCGTTTGAT GAGTTGATGC AAAATAGCGA TATCGTTACG 6 60 

CTTCATGTGC CGCTCAATAC GGATACGCAC TATATTATCA GCCACGAACA AATACAGAGA 720 

ATGAAGCAAG GAGCATTTCT TATCAATACT GGGCGCGGTC CACTTGTAGA TACCTATGAG 78 0 



-TTGGTTAAAG CATTAGAAAA CGGGAAACTG GGCGGTGCCG CATTGGATGT ATTGGAAGGA 84 0 

GAGGAAGAGT TTTTCTACTC TGATTGCACC CAAAAACCAA TTGATAATCA ATTTTTACTT 9 00 

AAACTTCAAA GAATGCCTAA CGTGATAATC ACACCGCATA CGGCCTATTA TACCGAGCAA 960 

GCGTTGCGTG ATACCGTTGA AAAAACCATT AAAAACTGTT TGGATTTTGA AAGGAGACAG 102 0 

GAGCATGAAT AGAATAAAAG TTGCAATACT GTTTGGGGGT TGCTCAGAGG AGCATGACGT 1080 

ATCGGTAAAA TCTGCAATAG AGATAGCCGC TAACATTAAT AAAGAAAAAT ACGAGCCGTT 114 0 

ATACATTGGA ATTACGAAAT CTGGTGTATG GAAAATGTGC GAAAAACCTT GCGCGGAATG 12 00 

GGAAAACGAC AATTGCTATT CAGCTGTACT CTCGCCGGAT AAAAAAATGC ACGGATTACT 12 60 

TGTTAAAAAG AACCATGAAT ATGAAATCAA C C ATGTTG AT GTAGCATTTT CAGCTTTGCA 13 2 0 

TGGCAAGTCA GGTGAAGATG GATCCATACA AGGTCTGTTT GAATTGTCCG GTATCCCTTT 13 8 0 

TGTAGGCTGC GATATTCAAA GCTCAGCAAT TTGTATGGAC AAATCGTTGA CATACATCGT 14 4 0 

TGCGAAAAAT GCTGGGATAG CTACTCCCGC CTTTTGGGTT ATTAATAAAG ATGATAGGCC 1500 

GGTGGCAGCT ACGTTTACCT ATCCTGTTTT TGTTAAGCCG GCGCGTTCAG GCTCATCCTT 1560 

CGGTGTGAAA AAAGTCAATA GCGCGGACGA ATTGGACTAC GCAATTGAAT CGGCAAGACA 162 0 

ATATGACAGC AAAATCTTAA TTGAGCAGGC TGTTTCGGGC TGTGAGGTCG GTTGTGCGGT 168 0 

ATTGGGAAAC AGTGCCGCGT TAGTTGTTGG CGAGGTGGAC CAAATCAGGC TGCAGTACGG 17 4 0 

AATCTTTCGT ATTCATCAGG AAGTCGAGCC GGAAAAAGGC TCTGAAAACG CAGTTATAAC 18 00 

CGTTCCCGCA GACCTTTCAG CAGAGGAGCG AGGACGGATA CAGGAAACGG CAAAAAAAAT 18 60 

ATATAAAGCG CTCGGCTGTA GAGGTCTAGC CCGTGTGGAT ATGTTTTTAC AAGATAACGG 192 0 

CCGCATTGTA CTGAACGAAG TCAATACTCT GCCCGGTTTC ACGTCATACA GTCGTTATCC 198 0 

CCGTATGATG GCCGCTGCAG GTATTGCACT TCCCGAACTG ATTGACCGCT TGATCGTATT 2 04 0 

AGCGTTAAAG GGGTGATAAG CATGGAAATA GGATTTACTT TTTTAGATGA AATAGTACAC 2100 

GGTGTTCGTT GGGACGCTAA ATATGCCACT TGGGATAATT TCACCGGAAA ACCGGTTGAC 2160 

GGTTATGAAG TAAATCGCAT TGTAGGGACA TACGAGTTGG CTGAATCGCT TTTGAAGGCA 222 0 

AAAGAACTGG CTGCTACCCA AGGGTACGGA TTGCTTCTAT GGGACGGTTA CCGTCCTAAG 2 2 80 

CGTGCTGTAA ACTGTTTTAT GCAATGGGCT GCACAGCCGG AAAATAACCT GACAAAGGAA 2 340 

AGTTATTATC CCAATATTGA CCGAACTGAG ATGATTTCAA AAGGATACGT GGCTTCAAAA 2 4 00 

TCAAGCCATA GCCGCGGCAG TGCCATTGAT CTTACGCTTT ATCGATTAGA CACGGGTGAG 2 4 60 

CTTGTACCAA TGGGGAGCCG ATTTGATTTT ATGGATGAAC GCTCTCATCA TGCGGCAAAT 2 52 0 



.GGAATATCAT GCAATGAAGC GCAAAATCGC AGACGTTTGC GCTCCATCAT GGAAAACAGT 2 580 



GGGTTTGAAG CAT AT AG C CT CGAATGGTGG CACTATGTAT TAAGAGACGA ACCATACCCC 2 640 

AATAGCTATT TTGATTTCCC CGTTAAA 2 667 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2964 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : unknown 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2964 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ATG AAA ATT GCG AGA GGT AGA GAA TTG CTT ACA CCG GAA CAG AGA CAG 48 
Met Lys lie Ala Arg Gly Arg Glu Leu Leu Thr Pro Glu Gin Arg Gin 
15 10 15 

GCT TTT ATG CAA ATC CCT GAA GAT GAA TGG ATA CTG GGG ACC TAC TTC 9 6 

Ala Phe Met Gin lie Pro Glu Asp Glu Trp lie Leu Gly Thr Tyr Phe 
20 25 30 

ACT TTT TCC AAA CGG GAT TTA GAA ATA GTT AAT AAG CGA AGG AGG GAA 144 
Thr Phe Ser Lys Arg Asp Leu Glu lie Val Asn Lys Arg Arg Arg Glu 
35 40 45 

GAA AAC CGT TTA GGA TTT GCT GTT CAA TTA GCT GTT CTT CGG TAT CCC 192 
Glu Asn Arg Leu Gly Phe Ala Val Gin Leu Ala Val Leu Arg Tyr Pro 
50 55 60 

GGT TGG CCA TAC ACT CAT ATC AAA AGC ATC CCA GAT TCG GTC ATA CAA 24 0 

Gly Trp Pro Tyr Thr His lie Lys Ser lie Pro Asp Ser Val lie Gin 
65 70 75 80 

TAT ATA TCG AAA CAG ATT GGT GTT AGT CCA TCC TCG CTT GAT CAT TAT 2 88 

Tyr lie Ser Lys Gin lie Gly Val Ser Pro Ser Ser Leu Asp His Tyr 

85 90 95 

CCT CAA AGG GAA AAT ACA CTT TGG GAT CAT TTG AAA GAA ATT CGA AGT 336 
Pro Gin Arg Glu Asn Thr Leu Trp Asp His Leu Lys Glu lie Arg Ser 
100 105 110 



GAA TAC GAC TTT GTA ACT TTT ACC CTG AGT GAA TAT CGA ATG ACA TTT 
Glu Tyr Asp Phe Val Thr Phe Thr Leu Ser Glu Tyr Arg Met Thr Phe 
115 120 125 



384 



AAG TAC CTT CAT CAA TTA GCT TTG GAA AAT GGT GAT GCC ATT CAT CTA 
•Lys Tyr Leu His Gin Leu Ala Leu Glu Asn Gly Asp Ala lie His Leu 
130 135 140 

CTG CAT GAA TGC ATA GAT TTT CTA AGA AAA AAC AAA ATT ATA CTG CCT 
Leu His Glu Cys lie Asp Phe Leu Arg Lys Asn Lys lie lie Leu Pro 
145 150 155 160 

GCT ATC ACT ACA CTT GAA AGA ATG GTG TGG GAA GCA AGG GCA ATG GCT 
Ala lie Thr Thr Leu Glu Arg Met Val Trp Glu Ala Arg Ala Met Ala 

165 170 175 

GAA AAG AAG CTA TTT AAT ACG GTT AGT AAA TCT CTA ACA AAT GAG CAA 
Glu Lys Lys Leu Phe Asn Thr Val Ser Lys Ser Leu Thr Asn Glu Gin 
180 185 190 

AAA GAA AAG CTT GAA GGG ATT ATT ACC TCG CAG CAT CCA TCC GAA TCC 
Lys Glu Lys Leu Glu Gly lie lie Thr Ser Gin His Pro Ser Glu Ser 
195 200 205 

AAT AAA ACG ATA TTG GGT TGG TTA AAA GAG CCA CCG GGT CAT CCT TCA 
Asn Lys Thr lie Leu Gly Trp Leu Lys Glu Pro Pro Gly His Pro Ser 
210 215 220 

CCC GAA ACT TTT CTA AAA ATA ATA GAA CGA CTC GAA TAC ATA CGA GGA 
Pro Glu Thr Phe Leu Lys lie lie Glu Arg Leu Glu Tyr lie Arg Gly 
225 230 235 240 

ATG GAT TTA GAA ACA GTG CAA ATT AGT CAT TTG CAC CGT AAC CGC CTG 
Met Asp Leu Glu Thr Val Gin lie Ser His Leu His Arg Asn Arg Leu 

245 250 255 

TTG CAG CTG TCT CGC TTA GGC TCA AGA TAC GAG CCG TAT GCA TTC CGT 
Leu Gin Leu Ser Arg Leu Gly Ser Arg Tyr Glu Pro Tyr Ala Phe Arg 
260 265 270 

GAC TTT CAA GAA AAT AAA CGT TAT TCG ATA TTA ACC ATC TAT TTA TTA 
Asp Phe Gin Glu Asn Lys Arg Tyr Ser lie Leu Thr lie Tyr Leu Leu 
275 280 285 

CAA CTT ACT CAG GAG CTA ACG GAT AAA GCG TTT GAA ATT CAT GAT AGG 
Gin Leu Thr Gin Glu Leu Thr Asp Lys Ala Phe Glu lie His Asp Arg 
290 295 300 

CAA ATA CTT AGT TTG TTA TCA AAA GGT CGT AAG GCT CAA GAG GAA ATC 
Gin lie Leu Ser Leu Leu Ser Lys Gly Arg Lys Ala Gin Glu Glu lie 
305 310 315 320 

CAG AAA CAA AAC GGT AAA AAG CTA AAT GAG AAA GTT ATA CAC TTT ACG 
Gin Lys Gin Asn Gly Lys Lys Leu Asn Glu Lys Val lie His Phe Thr 

325 330 335 



AAC ATC GGA CAA GCA TTA ATT AAA GCA AGA GAG GAA AAA TTA GAC GTT 
Asn lie Gly Gin Ala Leu lie Lys Ala Arg Glu Glu Lys Leu Asp Val 
340 345 350 



r TTT AAG GTT TTA GAA TCG GTT ATT GAA TGG AAT ACC TTT GTC TCT TCA 1104 
Phe Lys Val Leu Glu Ser Val lie Glu Trp Asn Thr Phe Val Ser Ser 
355 360 365 

GTA GAA GAG GCT CAG GAA CTT GCA CGT CCT GCC GAC TAT GAT TAT TTA 1152 
Val Glu Glu Ala Gin Glu Leu Ala Arg Pro Ala Asp Tyr Asp Tyr Leu 
370 375 380 

GAC TTA CTG CAA AAA CGG TTT TAT TCA CTA AGA AAA TAT ACG CCA ACG 12 00 

Asp Leu Leu Gin Lys Arg Phe Tyr Ser Leu Arg Lys Tyr Thr Pro Thr 
385 390 395 400 

CTA TTA AGA GTA TTG GAA TTT CAT TCT ACA AAG GCA AAT GAG CCA CTT 12 48 

Leu Leu Arg Val Leu Glu Phe His Ser Thr Lys Ala Asn Glu Pro Leu 

405 410 415 

TTA CAA GCT GTT GAG ATT ATC CGA GGA ATG AAC GAA TCT GGA AAG CGA 1296 
Leu Gin Ala Val Glu lie lie Arg Gly Met Asn Glu Ser Gly Lys Arg 
420 425 430 

AAA GTG CCT GAT GAC TCA CCT GTG GAT TTT ATT TCA AAA CGA TGG AAA 13 44 

Lys Val Pro Asp Asp Ser Pro Val Asp Phe lie Ser Lys Arg Trp Lys 
435 440 445 

AGA CAT TTA TAC GAG GAT GAT GGT ACA ACA ATT AAT CGT CAT TAC TAT 13 92 

Arg His Leu Tyr Glu Asp Asp Gly Thr Thr lie Asn Arg His Tyr Tyr 
450 455 460 

GAA ATG GCT GTT TTA ACA GAA CTT CGG GAG CAT GTT CGG GCA GGA GAT 14 4 0 

Glu Met Ala Val Leu Thr Glu Leu Arg Glu His Val Arg Ala Gly Asp 
465 470 475 480 

GTT TCC ATT GTT GGC AGC AGA CAA TAT AGG GAT TTT GAG GAA TAT TTG 14 88 

Val Ser lie Val Gly Ser Arg Gin Tyr Arg Asp Phe Glu Glu Tyr Leu 

485 490 495 

TTT TCG GAA GAT ACA TGG AAT CAA TCG AAG GGG AAT ACG AGA TTA TCA 153 6 

Phe Ser Glu Asp Thr Trp Asn Gin Ser Lys Gly Asn Thr Arg Leu Ser 
500 505 510 

GTT AGT TTA TCA TTC GAA GAT TAT ATA ACG GAG AGA ACC AGC AGC TTT 1584 
Val Ser Leu Ser Phe Glu Asp Tyr lie Thr Glu Arg Thr Ser Ser Phe 
515 520 525 

AAT GAA AGG TTA AAG TGG TTA GCT GCC AAT TCC AAT AAG TTA GAT GGG 163 2 

Asn Glu Arg Leu Lys Trp Leu Ala Ala Asn Ser Asn Lys Leu Asp Gly 
530 535 540 

GTT TCT CTT GAA AAA GGA AAG CTA TCA CTT GCA CGC TTA GAA AAA GAT 168 0 

Val Ser Leu Glu Lys Gly Lys Leu Ser Leu Ala Arg Leu Glu Lys Asp 
545 550 555 560 

GTT CCA GAA GAA GCA AAA AAA TTT AGT GCA AGC CTT TAT CAG ATG CTA 17 2 8 

Val Pro Glu Glu Ala Lys Lys Phe Ser Ala Ser Leu Tyr Gin Met Leu 

565 570 575 



CCA AGA ATA AAA TTA ACT GAT TTA CTC ATG GAT GTG GCC CAT ATA AC A 177 6 

Pro Arg lie Lys Leu Thr Asp Leu Leu Met Asp Val Ala His lie Thr 
580 585 590 

GGA TTT CAT GAG CAA TTC ACT CAT GCT TCC AAT AAT CGA AAA CCA GAT 182 4 

Gly Phe His Glu Gin Phe Thr His Ala Ser Asn Asn Arg Lys Pro Asp 
595 600 605 

AAG GAA GAA ACA ATC ATT ATC ATG GCT GCC CTT TTA GGA ATG GGA ATG 187 2 

Lys Glu Glu Thr lie lie lie Met Ala Ala Leu Leu Gly Met Gly Met 
610 615 620 

AAT ATT GGC TTG AGC AAG ATG GCC GAA GCC ACA CCC GGA CTT ACA TAT 192 0 

Asn lie Gly Leu Ser Lys Met Ala Glu Ala Thr Pro Gly Leu Thr Tyr 
625 630 635 640 

AAG CAA CTA GCC AAT GTA TCT CAA TGG CGC ATG TAT GAA GAT GCC ATG 1968 
Lys Gin Leu Ala Asn Val Ser Gin Trp Arg Met Tyr Glu Asp Ala Met 

645 650 655 

AAT AAA GCC CAA GCC ATA TTA GTA AAC TTT CAT CAT AAA TTA CAA TTG 2 016 

Asn Lys Ala Gin Ala lie Leu Val Asn Phe His His Lys Leu Gin Leu 
660 665 670 

CCT TTC TAT TGG GGC GAC GGT ACA ACA TCT TCG TCA GAT GGT ATG AGA 2 064 

Pro Phe Tyr Trp Gly Asp Gly Thr Thr Ser Ser Ser Asp Gly Met Arg 
675 680 685 

ATG CAG CTA GGT GTT TCA TCA CTA CAT GCA GAT GCA AAT CCA CAT TAT 2112 
Met Gin Leu Gly Val Ser Ser Leu His Ala Asp Ala Asn Pro His Tyr 
690 695 700 

GGA ACT GGA AAA GGA GCC ACC ATC TAC CGA TTT ACA AGT GAT CAA TTC 2160 
Gly Thr Gly Lys Gly Ala Thr lie Tyr Arg Phe Thr Ser Asp Gin Phe 
705 710 715 720 

TCT TCT TAC TAC ACA AAG ATT ATT CAT ACT AAT TCA AGA GAT GCG ATT 2 2 08 

Ser Ser Tyr Tyr Thr Lys lie lie His Thr Asn Ser Arg Asp Ala lie 

725 730 735 

CAT GTT TTG GAT GGT TTG TTA CAT CAT GAG ACG GAT CTA AAC ATA GAG 22 56 

His Val Leu Asp Gly Leu Leu His His Glu Thr Asp Leu Asn lie Glu 
740 745 750 

GAA CAT TAT ACA GAC ACT GCC GGT TAC ACT GAC CAA ATA TTC GGA CTG 2 3 04 

Glu His Tyr Thr Asp Thr Ala Gly Tyr Thr Asp Gin lie Phe Gly Leu 
755 760 765 

ACT CAT TTA TTA GGA TTT AAA TTT GCC CCA AGA ATA AGG GAT TTA TCG 2 3 52 

Thr His Leu Leu Gly Phe Lys Phe Ala Pro Arg lie Arg Asp Leu Ser 
770 775 780 

GAC TCA AAA TTA TTT ACG ATA GAT AAA GCA AGT GAG TAT CCA AAA CTA 24 00 

Asp Ser Lys Leu Phe Thr lie Asp Lys Ala Ser Glu Tyr Pro Lys Leu 
785 790 795 800 



,GAA GCC ATT TTA CGT GGA CAA ATA AAT ACA AAG GTC ATT AAA GAA AAT 
Glu Ala lie Leu Arg Gly Gin lie Asn Thr Lys Val lie Lys Glu Asn 

805 810 815 



2448 



TAT GAG GAT GTT TTG CGA TTA GCT CAT TCT ATA AGG GAG GGA ACA GTT 2 49 6 

Tyr Glu Asp Val Leu Arg Leu Ala His Ser lie Arg Glu Gly Thr Val 
820 825 830 

TCA GCA TCC CTT ATT ATG GGG AAG CTA GGT TCC TAT TCA AGA CAA AAC 2 544 

Ser Ala Ser Leu lie Met Gly Lys Leu Gly Ser Tyr Ser Arg Gin Asn 
835 840 845 

AGC TTA GCT ACA GCC TTA CGT GAG ATG GGC CGA ATA GAA AAA ACG ATC 2 592 

Ser Leu Ala Thr Ala Leu Arg Glu Met Gly Arg lie Glu Lys Thr lie 
850 855 860 

TTT ATT TTG AAT TAT ATA TCG GAT GAA TCA TTA AGA AGA AAA ATA CAA 2 64 0 

Phe lie Leu Asn Tyr lie Ser Asp Glu Ser Leu Arg Arg Lys lie Gin 
865 870 875 880 

AGA GGA TTG AAT AAA GGA GAA GCC ATG AAT GGA TTG GCA AGA GCT ATT 2 688 

Arg Gly Leu Asn Lys Gly Glu Ala Met Asn Gly Leu Ala Arg Ala lie 

885 890 895 

TTC TTC GGA AAA CAA GGT GAG CTT AGA GAA CGC ACC ATA CAG CAT CAA 273 6 

Phe Phe Gly Lys Gin Gly Glu Leu Arg Glu Arg Thr lie Gin His Gin 
900 905 910 

TTG CAA AGA GCC AGT GCT TTA AAC ATA ATT ATC AAT GCT ATA AGT ATT 27 84 

Leu Gin Arg Ala Ser Ala Leu Asn lie lie lie Asn Ala lie Ser lie 
915 920 925 

TGG AAT ACT CTC CAC CTA ACA ACA GCA GTT GAA TAT AAA AAA CGG ACA 28 32 

Trp Asn Thr Leu His Leu Thr Thr Ala Val Glu Tyr Lys Lys Arg Thr 
930 935 940 

GGT AGC TTT AAT GAA GAT TTG TTA CAC CAT ATG TCG CCC TTA GGT TGG 28 8 0 

Gly Ser Phe Asn Glu Asp Leu Leu His His Met Ser Pro Leu Gly Trp 
945 950 955 960 

GAA CAT ATT AAT TTA CTA GGA GAA TAC CAT TTT AAC TCA GAG AAA GTA 2928 
Glu His lie Asn Leu Leu Gly Glu Tyr His Phe Asn Ser Glu Lys Val 

965 970 975 

GTC TCA TTA AAT TCT TTA AGA CCA CTA AAA CTT TCT 29 64 

Val Ser Leu Asn Ser Leu Arg Pro Leu Lys Leu Ser 
980 985 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 988 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 19: 

Met Lys lie Ala Arg Gly Arg Glu Leu Leu Thr Pro Glu Gin Arg Gin 
15 10 15 

Ala Phe Met. Gin lie Pro Glu Asp Glu Trp lie Leu Gly Thr Tyr Phe 
20 25 30 

Thr Phe Ser Lys Arg Asp Leu Glu lie Val Asn Lys Arg Arg Arg Glu 
35 40 45 

Glu Asn Arg Leu Gly Phe Ala Val Gin Leu Ala Val Leu Arg Tyr Pro 
50 55 60 

Gly Trp Pro Tyr Thr His lie Lys Ser lie Pro Asp Ser Val lie Gin 
65 70 75 80 

Tyr lie Ser Lys Gin lie Gly Val Ser Pro Ser Ser Leu Asp His Tyr 

85 90 95 

Pro Gin Arg Glu Asn Thr Leu Trp Asp His Leu Lys Glu lie Arg Ser 
100 105 110 

Glu Tyr Asp Phe Val Thr Phe Thr Leu Ser Glu Tyr Arg Met Thr Phe 
115 120 125 

Lys Tyr Leu His Gin Leu Ala Leu Glu Asn Gly Asp Ala lie His Leu 
130 135 140 

Leu His Glu Cys lie Asp Phe Leu Arg Lys Asn Lys lie lie Leu Pro 
145 150 155 160 

Ala lie Thr Thr Leu Glu Arg Met Val Trp Glu Ala Arg Ala Met Ala 

165 170 175 

Glu Lys Lys Leu Phe Asn Thr Val Ser Lys Ser Leu Thr Asn Glu Gin 
180 185 190 

Lys Glu Lys Leu Glu Gly lie lie Thr Ser Gin His Pro Ser Glu Ser 
195 200 205 

Asn Lys Thr lie Leu Gly Trp Leu Lys Glu Pro Pro Gly His Pro Ser 
210 215 220 

Pro Glu Thr Phe Leu Lys lie lie Glu Arg Leu Glu Tyr lie Arg Gly 
225 230 235 240 

Met Asp Leu Glu Thr Val Gin lie Ser His Leu His Arg Asn Arg Leu 

245 250 255 

Leu Gin Leu Ser Arg Leu Gly Ser Arg Tyr Glu Pro Tyr Ala Phe Arg 
260 265 270 

Asp Phe Gin Glu Asn Lys Arg Tyr Ser lie Leu Thr lie Tyr Leu Leu 
275 280 285 

Gin Leu Thr Gin Glu Leu Thr Asp Lys Ala Phe Glu lie His Asp Arg 
290 295 300 



Gin lie Leu Ser Leu Leu Ser Lys Gly Arg Lys Ala Gin Glu Glu lie 
'305 310 315 320 

Gin Lys Gin Asn Gly Lys Lys Leu Asn Glu Lys Val lie His Phe Thr 

325 330 335 

Asn lie Gly Gin Ala Leu lie Lys Ala Arg Glu Glu Lys Leu Asp Val 
340 345 350 

Phe Lys Val Leu Glu Ser Val lie Glu Trp Asn Thr Phe Val Ser Ser 
355 360 365 

Val Glu Glu Ala Gin Glu Leu Ala Arg Pro Ala Asp Tyr Asp Tyr Leu 
370 375 380 

Asp Leu Leu Gin Lys Arg Phe Tyr Ser Leu Arg Lys Tyr Thr Pro Thr 
385 390 395 400 

Leu Leu Arg Val Leu Glu Phe His Ser Thr Lys Ala Asn Glu Pro Leu 

405 410 415 

Leu Gin Ala Val Glu lie lie Arg Gly Met Asn Glu Ser Gly Lys Arg 
420 425 430 

Lys Val Pro Asp Asp Ser Pro Val Asp Phe lie Ser Lys Arg Trp Lys 
435 440 445 

Arg His Leu Tyr Glu Asp Asp Gly Thr Thr lie Asn Arg His Tyr Tyr 
450 455 460 

Glu Met Ala Val Leu Thr Glu Leu Arg Glu His Val Arg Ala Gly Asp 
465 470 475 480 

Val Ser lie Val Gly Ser Arg Gin Tyr Arg Asp Phe Glu Glu Tyr Leu 

485 490 495 

Phe Ser Glu Asp Thr Trp Asn Gin Ser Lys Gly Asn Thr Arg Leu Ser 
500 505 510 

Val Ser Leu Ser Phe Glu Asp Tyr lie Thr Glu Arg Thr Ser Ser Phe 
515 520 525 

Asn Glu Arg Leu Lys Trp Leu Ala Ala Asn Ser Asn Lys Leu Asp Gly 
530 535 540 

Val Ser Leu Glu Lys Gly Lys Leu Ser Leu Ala Arg Leu Glu Lys Asp 
545 550 555 560 

Val Pro Glu Glu Ala Lys Lys Phe Ser Ala Ser Leu Tyr Gin Met Leu 

565 570 575 

Pro Arg lie Lys Leu Thr Asp Leu Leu Met Asp Val Ala His lie Thr 
580 585 590 

Gly Phe His Glu Gin Phe Thr His Ala Ser Asn Asn Arg Lys Pro Asp 
595 600 605 



Lys Glu Glu Thr lie lie lie Met Ala Ala Leu Leu Gly Met Gly Met 
610 615 620 

Asn lie Gly Leu Ser Lys Met Ala Glu Ala Thr Pro Gly Leu Thr Tyr 
625 630 635 640 

Lys Gin Leu Ala Asn Val Ser Gin Trp Arg Met Tyr Glu Asp Ala Met 

645 650 655 

Asn Lys Ala Gin Ala lie Leu Val Asn Phe His His Lys Leu Gin Leu 
660 665 670 

Pro Phe Tyr Trp Gly Asp Gly Thr Thr Ser Ser Ser Asp Gly Met Arg 
675 680 685 

Met Gin Leu Gly Val Ser Ser Leu His Ala Asp Ala Asn Pro His Tyr 
690 695 700 

Gly Thr Gly Lys Gly Ala Thr lie Tyr Arg Phe Thr Ser Asp Gin Phe 
705 710 715 720 

Ser Ser Tyr Tyr Thr Lys lie lie His Thr Asn Ser Arg Asp Ala lie 

725 730 735 

His Val Leu Asp Gly Leu Leu His His Glu Thr Asp Leu Asn lie Glu 
740 745 750 

Glu His Tyr Thr Asp Thr Ala Gly Tyr Thr Asp Gin lie Phe Gly Leu 
755 760 765 

Thr His Leu Leu Gly Phe Lys Phe Ala Pro Arg lie Arg Asp Leu Ser 
770 775 780 

Asp Ser Lys Leu Phe Thr lie Asp, Lys Ala Ser Glu Tyr Pro Lys Leu 
785 790 795 800 

Glu Ala lie Leu Arg Gly Gin lie Asn Thr Lys Val lie Lys Glu Asn 

805 810 815 

Tyr Glu Asp Val Leu Arg Leu Ala His Ser lie Arg Glu Gly Thr Val 
820 825 830 

Ser Ala Ser Leu lie Met Gly Lys Leu Gly Ser Tyr Ser Arg Gin Asn 
835 840 845 

Ser Leu Ala Thr Ala Leu Arg Glu Met Gly Arg lie Glu Lys Thr lie 
850 855 860 

Phe lie Leu Asn Tyr lie Ser Asp Glu Ser Leu Arg Arg Lys lie Gin 
865 870 875 880 

Arg Gly Leu Asn Lys Gly Glu Ala Met Asn Gly Leu Ala Arg Ala lie 

885 .890 895 

Phe Phe Gly Lys Gin Gly Glu Leu Arg Glu Arg Thr lie Gin His Gin 
900 905 910 



ieu Gin Arg Ala Ser Ala Leu Asn lie He He Asn Ala He Ser lie 
915 920 925 



Trp Asn Thr Leu His Leu Thr Thr Ala Val Glu Tyr Lys Lys Arg Thr 
930 935 940 

Gly Ser Phe Asn Glu Asp Leu Leu His His Met Ser Pro Leu Gly Trp 
945 950 955 960 

Glu His He Asn Leu Leu Gly Glu Tyr His Phe Asn Ser Glu Lys Val 

965 970 975 

Val Ser Leu Asn Ser Leu Arg Pro Leu Lys Leu Ser 
980 985 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 573 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..573 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

TTG CGG AAA ATC GGT TAT ATT CGT GTC AGT TCG ACT AAC CAG AAT CCT 
Leu Arg Lys He Gly Tyr He Arg Val Ser Ser Thr Asn Gin Asn Pro 
1 5 10 15 

TCA AGA CAA TTT CAG CAG TTG AAC GAG ATC GGA ATG GAT ATT ATA TAT 
Ser Arg Gin Phe Gin Gin Leu Asn Glu He Gly Met Asp He He Tyr 
20 25 30 

GAA GAG AAA GTT TCA GGA GCA ACA AAG GAT CGC GAG CAA CTT CAA AAA 
Glu Glu Lys Val Ser Gly Ala Thr Lys Asp Arg Glu Gin Leu Gin Lys 
35 40 45 

GTG TTA GAC GAT TTA CAG GAA GAT GAC ATC ATT TAT GTT ACA GAC TTA 
Val Leu Asp Asp Leu Gin Glu Asp Asp lie He Tyr Val Thr Asp Leu 
50 55 60 

ACT CGA ATC ACT CGT AGT ACA CAA GAT CTA TTT GAA TTA ATC GAT AAC 
Thr Arg He Thr Arg Ser Thr Gin Asp Leu Phe Glu Leu He Asp Asn 
65 70 75 80 



ATA CGA GAT AAA AAG GCA AGT TTA AAA TCA CTA AAA GAT ACA TGG CTT 
He Arg Asp Lys Lys Ala Ser Leu Lys Ser Leu Lys Asp Thr Trp Leu 

85 90 95 



.GAT TTA TCA GAA GAT AAT CCA TAC AGC CAA TTC TTA ATT ACT GTA ATG 
Asp Leu Ser Glu Asp Asn Pro Tyr Ser Gin Phe Leu lie Thr Val Met 
100 105 110 



336 



GCT GGT GTT AAC CAA TTA GAG CGA GAT CTT ATT CGG ATG AGA CAA CGT 3 84 

Ala Gly Val Asn Gin Leu Glu Arg Asp Leu lie Arg Met Arg Gin Arg 
115 120 125 

GAA GGG ATT GAA TTG GCT AAG AAA GAA GGA AAG TTT AAA GGT CGA TTA 43 2 

Glu Gly lie Glu Leu Ala Lys Lys Glu Gly Lys Phe Lys Gly Arg Leu 
130 135 140 

AAG AAG TAT CAT AAA AAT CAC GCA GGA ATG AAT TAT GCG GTA AAG CTA 48 0 

Lys Lys Tyr His Lys Asn His Ala Gly Met Asn Tyr Ala Val Lys Leu 
145 150 155 160 

TAT AAA GAA GGA AAT ATG ACT GTA AAT CAA ATT TGT GAA ATT ACT AAT 52 8 

Tyr Lys Glu Gly Asn Met Thr Val Asn Gin lie Cys Glu lie Thr Asn 

165 170 175 

GTA TCT AGG GCT TCA TTA TAC AGG AAA TTA TCA GAA GTG AAT AAT 57 3 

Val Ser Arg Ala Ser Leu Tyr Arg Lys Leu Ser Glu Val Asn Asn 
180 185 190 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Leu Arg Lys lie Gly Tyr lie Arg Val Ser Ser Thr Asn Gin Asn Pro 
15 10 15 

Ser Arg Gin Phe Gin Gin Leu Asn Glu lie Gly Met Asp lie lie Tyr 
20 25 30 

Glu Glu Lys Val Ser Gly Ala Thr Lys Asp Arg Glu Gin Leu Gin Lys 
35 40 45 

Val Leu Asp Asp Leu Gin Glu Asp Asp lie lie Tyr Val Thr Asp Leu 
50 55 60 

Thr Arg lie Thr Arg Ser Thr Gin Asp Leu Phe Glu Leu lie Asp Asn 
65 70 75 80 

lie Arg Asp Lys Lys Ala Ser Leu Lys Ser Leu Lys Asp Thr Trp Leu 

85 90 95 

Asp Leu Ser Glu Asp Asn Pro Tyr Ser Gin Phe Leu lie Thr Val Met 
100 105 110 



Ala Gly Val Asn 
115 

Glu Gly lie Glu 
130 

Lys Lys Tyr His 
145 

Tyr Lys Glu Gly 



Val Ser Arg Ala 
180 



Gin Leu Glu Arg 
120 

Leu Ala Lys Lys 
135 

Lys Asn His Ala 
150 

Asn Met Thr Val 
165 

Ser Leu Tyr Arg 



Asp Leu lie Arg 



Glu Gly Lys Phe 
140 

Gly Met Asn Tyr 
155 

Asn Gin lie Cys 
170 

Lys Leu Ser Glu 
185 



Met Arg Gin Arg 
125 

Lys Gly Arg Leu . 



Ala Val Lys Leu 
160 

Glu lie Thr Asn 
175 

Val Asn Asn 
190 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 909 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..909 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ATG AAG AAG TTG TTT TTT TTA TTG TTA TTG TTA TTC TTA ATA TAC TTA 
Met Lys Lys Leu Phe Phe Leu Leu Leu Leu Leu Phe Leu lie Tyr Leu 
15 10 15 

GGT TAT GAC TAC GTT AAT GAA GCA CTG TTT TCT CAG GAA AAA GTC GAA 
Gly Tyr Asp Tyr Val Asn Glu Ala Leu Phe Ser Gin Glu Lys Val Glu 
20 25 30 

TTT CAA AAT TAT GAT CAA AAT CCC AAA GAA CAT TTA GAA AAT AGT GGG 
Phe Gin Asn Tyr Asp Gin Asn Pro Lys Glu His Leu Glu Asn Ser Gly 
35 40 45 

ACT TCT GAA AAT ACC CAA GAG AAA ACA ATT ACA GAA GAA CAG GTT TAT 
Thr Ser Glu Asn Thr Gin Glu Lys Thr lie Thr Glu Glu Gin Val Tyr 
50 55 60 

CAA GGA AAT CTG CTA TTA ATC AAT AGT AAA TAT CCT GTT CGC CAA GAA 
Gin Gly Asn Leu Leu Leu lie Asn Ser Lys Tyr Pro Val Arg Gin Glu 
65 70 75 80 

AGT GTG AAG TCA GAT ATC GTG AAT TTA TCT AAA CAT GAC GAA TTA ATA 
Ser Val Lys Ser Asp lie Val Asn Leu Ser Lys His Asp Glu Leu lie 

85 90 95 



AAT GGA TAC GGG TTG CTT GAT AGT AAT ATT TAT ATG TCA AAA GAA ATA 
-Asn Gly Tyr Gly Leu Leu Asp Ser Asn lie Tyr Met Ser Lys Glu lie 
100 105 110 



GCA CAA AAA TTT TCA GAG ATG GTC AAT GAT GCT GTA AAG GGT GGC GTT 
Ala Gin Lys Phe Ser Glu Met Val Asn Asp Ala Val Lys Gly Gly Val 
115 120 125 

AGT CAT TTT ATT ATT AAT AGT GGC TAT CGA GAC TTT GAT GAG CAA AGT 
Ser His Phe lie lie Asn Ser Gly Tyr Arg Asp Phe Asp Glu Gin Ser 
130 135 140 

GTG CTT TAC CAA GAA ATG GGG GCT GAG TAT GCC TTA CCA GCA GGT TAT 
Val Leu Tyr Gin Glu Met Gly Ala Glu Tyr Ala Leu Pro Ala Gly Tyr 
145 150 155 160 

AGT GAG CAT AAT TCA GGT TTA TCA CTA GAT GTA GGA TCA AGC TTG ACG 
Ser Glu His Asn Ser Gly Leu Ser Leu Asp Val Gly Ser Ser Leu Thr 

165 170 175 

AAA ATG GAA CGA GCC CCT GAA GGA AAG TGG ATA GAA GAA AAT GCT TGG 
Lys Met Glu Arg Ala Pro Glu Gly Lys Trp lie Glu Glu Asn Ala Trp 
180 185 190 

AAA TAC GGG TTC ATT TTA CGT TAT CCA GAG GAC AAA ACA GAG TTA ACA 
Lys Tyr Gly Phe lie Leu Arg Tyr Pro Glu Asp Lys Thr Glu Leu Thr 
195 200 205 

GGA ATT CAA TAT GAA CCA TGG CAT ATT CGC TAT GTT GGT TTA CCA CAT 
Gly lie Gin Tyr Glu Pro Trp His lie Arg Tyr Val Gly Leu Pro His 
210 215 220 

AGT GCG ATT ATG AAA GAA AAG AAT TTC GTT CTC GAG GAA TAT ATG GAT 
Ser Ala lie Met Lys Glu Lys Asn Phe Val Leu Glu Glu Tyr Met Asp 
225 230 235 240 

TAC CTA AAA GAA GAA AAA ACC ATT TCT GTT AGT GTA AAT GGG GAA AAA 
Tyr Leu Lys Glu Glu Lys Thr lie Ser Val Ser Val Asn Gly Glu Lys 

245 250 255 

TAT GAG ATC TTT TAT TAT CCT GTT ACT AAA AAT ACC ACC ATT CAT GTG 
Tyr Glu lie Phe Tyr Tyr Pro Val Thr Lys Asn Thr Thr lie His Val 
260 265 270 

CCG ACT AAT CTT CGT TAT GAG ATA TCA GGA AAC AAT ATA GAC GGT GTA 
Pro Thr Asn Leu Arg Tyr Glu lie Ser Gly Asn Asn lie Asp Gly Val 
275 280 285 

ATT GTG ACA GTG TTT CCC GGA TCA ACA CAT ACT AAT TCA AGG AGG 
lie Val Thr Val Phe Pro Gly Ser Thr His Thr Asn Ser Arg Arg 
290 295 300 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 03 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Lys Lys Leu Phe Phe Leu Leu Leu Leu Leu Phe Leu lie Tyr Leu 
15 10 15 

Gly Tyr Asp Tyr Val Asn Glu Ala Leu Phe Ser Gin Glu Lys Val Glu 
20 25 30 

Phe Gin Asn Tyr Asp Gin Asn Pro Lys Glu His Leu Glu Asn Ser Gly 
35 40 45 

Thr Ser Glu Asn Thr Gin Glu Lys Thr lie Thr Glu Glu Gin Val Tyr 
50 55 60 

Gin Gly Asn Leu Leu Leu lie Asn Ser Lys Tyr Pro Val Arg Gin Glu 
65 70 75 80 

Ser Val Lys Ser Asp lie Val Asn Leu Ser Lys His Asp Glu Leu lie 

85 90 95 

Asn Gly Tyr Gly Leu Leu Asp Ser Asn lie Tyr Met Ser Lys Glu lie 
100 105 110 

Ala Gin Lys Phe Ser Glu Met Val Asn Asp Ala Val Lys Gly Gly Val 
115 120- 125 

Ser His Phe lie lie Asn Ser Gly Tyr Arg Asp Phe Asp Glu Gin Ser 
130 135 140 

Val Leu Tyr Gin Glu Met Gly Ala Glu Tyr Ala Leu Pro Ala Gly Tyr 
145 150 155 160 

Ser Glu His Asn Ser Gly Leu Ser Leu Asp Val Gly Ser Ser Leu Thr 

165 170 175 

Lys Met Glu Arg Ala Pro Glu Gly Lys Trp lie Glu Glu Asn Ala Trp 
180 185 190 

Lys Tyr Gly Phe lie Leu Arg Tyr Pro Glu Asp Lys Thr Glu Leu Thr 
195 200 205 

Gly lie Gin Tyr Glu Pro Trp His lie Arg Tyr Val Gly Leu Pro His 
210 215 220 

Ser Ala lie Met Lys Glu Lys Asn Phe Val Leu Glu Glu Tyr Met Asp 
225 230 235 240 

Tyr Leu Lys Glu Glu Lys Thr lie Ser Val Ser Val Asn Gly Glu Lys 

245 250 255 



Tyr Glu lie Phe Tyr Tyr Pro Val Thr Lys Asn Thr Thr lie His Val 
260 265 270 

Pro Thr Asn Leu Arg Tyr Glu lie Ser Gly Asn Asn lie Asp Gly Val 
275 280 285 

lie Val Thr Val Phe Pro Gly Ser Thr His Thr Asn Ser Arg Arg 
290 295 300 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..483 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



TTG GGA AAA ATA TTA TCT AGA GGA TTG CTA GCT TTA TAT TTA GTG ACA 
Leu Gly Lys lie Leu Ser Arg Gly Leu Leu Ala Leu Tyr Leu Val Thr 
1 5 10 15 



48 



CTA ATC TGG TTA GTG TTA TTC AAA TTA CAA TAC AAT ATT TTA TCA GTA 
Leu lie Trp Leu Val Leu Phe Lys Leu Gin Tyr Asn lie Leu Ser Val 
20 25 30 



96 



TTT AAT TAT CAT CAA AGA AGT CTT AAC TTG ACT CCA TTT ACT GCT ACT 
Phe Asn Tyr His Gin Arg Ser Leu Asn Leu Thr Pro Phe Thr Ala Thr 
35 40 45 



144 



GGG AAT TTC AGA GAG ATG ATA GAT AAT GTT ATA ATC TTT ATT CCA TTT 
Gly Asn Phe Arg Glu Met lie Asp Asn Val lie lie Phe lie Pro Phe 
50 55 60 



192 



GGC TTG CTT TTG AAT GTC AAT TTT AAA GAA ATC GGA TTT TTA CCT AAG 2 40 

Gly Leu Leu Leu Asn Val Asn Phe Lys Glu lie Gly Phe Leu Pro Lys 
65 70 75 80 



TTT GCT TTT GTA CTG GTT TTA AGT CTT ACT TTT GAA ATA ATT CAA TTT 288 
Phe Ala Phe Val Leu Val Leu Ser Leu Thr Phe Glu lie lie Gin Phe 

85 90 95 

ATC TTC GCT ATT GGA GCG ACA GAC ATA ACA GAT GTA ATT ACA AAT ACT 3 36 

lie Phe Ala lie Gly Ala Thr Asp lie Thr Asp Val lie Thr Asn Thr 
100 105 110 



GTT GGA GGC TTT CTT GGA CTG AAA TTA TAT GGT TTA AGC AAT AAG CAT 
Val Gly Gly Phe Leu Gly Leu Lys Leu Tyr Gly Leu Ser Asn Lys His 
115 120 125 



384 



ATG AAT CAA AAA AAA TTA GAC AGA GTT ATT ATT TTT GTA GGT ATA CTT 43 2 

Met Asn Gin Lys Lys Leu Asp Arg Val lie lie Phe Val Gly lie Leu 
130 135 140 

TTG CTC GTA TTA TTG CTC GTT TAC CGT ACC CAT TTA AGA ATA AAT TAC 48 0 

Leu Leu Val Leu Leu Leu Val Tyr Arg Thr His Leu Arg lie Asn Tyr 
145 150 155 160 

GTG 483 
Val 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Leu Gly Lys lie Leu Ser Arg Gly Leu Leu Ala Leu Tyr Leu Val Thr 
1 5 10 15 

Leu lie Trp Leu Val Leu Phe Lys Leu Gin Tyr Asn lie Leu Ser Val 
20 25 30 

Phe Asn Tyr His Gin Arg Ser Leu Asn Leu Thr Pro Phe Thr Ala Thr 
35 40 45 

Gly Asn Phe Arg Glu Met lie Asp Asn Val lie lie Phe lie Pro Phe 
50 55 60 

Gly Leu Leu Leu Asn Val Asn Phe Lys Glu lie Gly Phe Leu Pro Lys 
65 70 75 80 

Phe Ala Phe Val Leu Val Leu Ser Leu Thr Phe Glu lie lie Gin Phe 

85 90 95 

lie Phe Ala lie Gly Ala Thr Asp lie Thr Asp Val lie Thr Asn Thr 
100 105 110 

Val Gly Gly Phe Leu Gly Leu Lys Leu Tyr Gly Leu Ser Asn Lys His 
115 120 125 

Met Asn Gin Lys Lys Leu Asp Arg Val lie lie Phe Val Gly lie Leu 
130 135 140 

Leu Leu Val Leu Leu Leu Val Tyr Arg Thr His Leu Arg lie Asn Tyr 
145 150 155 160 



Val 



(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
ATGCTCCTGT CTCCTTTC 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2296 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRII 

Lys Leu Phe Phe Leu 
1 5 

Leu Phe Ser His Cys 
20 

Asp Ser Arg Gin Tyr 
35 

Gin Met Asn Gly lie 
50 

Cys His Ser lie Phe 
65 

Ser Asn Phe Phe Gin 

85 

Met lie Lys Arg Gly 
100 

Tyr Asp Arg lie Trp 
115 

lie Pro Lys Asn Ser 
130 

Ser Leu lie Asn Tyr 
145 



TION: SEQ ID NO: 27: 

Leu lie Cys Arg Phe Thr 

10 

Pro Cys Phe Pro His His 
25 

Asn Phe Val Phe Ser Lys 
40 

Thr lie Phe Gin Ser Leu 
55 

Thr Gin Gly Lys Ser Tyr 
70 75 

Met lie Pro Lys Cys lie 

90 

Trp Thr Asn Thr Asn Leu 
105 

Asp Ala Phe Asp Met Ser 
120 

Leu Asn Ser Lys Ser Thr 
135 

Phe lie Pro Phe Gly Lys 
150 155 



Asn Arg lie Lys Leu 
15 

Ser Phe Lys Cys Ser 
30 

lie Tyr Ala Phe Met 
45 

Met Lys Val Leu Lys 
60 

Lys Val Val Phe Thr 

80 

Phe Pro Leu Arg lie 
95 

Phe Arg Tyr lie Leu 
110 

Val Trp Pro Thr Gly 
125 

Val Phe Phe Pro Pro 
140 

Ser Glu Val Gly Pro 

160 



Gin Tyr Pro Phe lie Phe Arg Asp Leu His Lys Ser Leu Ser Leu Phe 

165 170 175 

Arg Cys Lys Gin Phe Ser Thr Ser Arg Asn Phe His Ser Val Ser Phe 
180 185 190 

His Phe Cys lie Phe Asn Leu Leu Val Gin Leu Tyr lie Asn Arg Val 
195 200 205 

Tyr Ser lie Asp Thr Asn Val Val Asp Asn His Ser Glu Arg Leu lie 
210 215 220 

Arg Leu Val Ser Lys Met Arg Tyr Phe Ala Glu Asn Arg Leu Tyr Ser 
225 230 235 240 

Cys Gin Phe Asp Pro Glu Ser Phe Lys Thr lie Ser Ala Val Glu Arg 

245 250 255 

Asp Arg Asn Gly Tyr Tyr lie Lys Arg Lys Phe Gin Glu Gin Gin Arg 
260 265 270 

lie Ala Ser Asn Phe Lys Lys Cys Thr lie Tyr Arg Lys Met Thr Ser 
275 280 285 

Phe Met Leu Gin Thr Leu Glu Ser Leu Val Val His Lys lie Tyr Leu 
290 295 300 

Asn Ser lie Thr Tyr Glu lie Lys Arg Gin Val Asn His Lys lie His 
305 310 315 320 

Gly Leu lie Tyr Gin Lys lie lie His Thr Ala Asn Ser Leu Leu Trp 

325 330 335 

Leu Val Leu Thr Asn Ser Glu lie Leu Phe Gly Asp Asn Val Lys Gly 
340 345 350 

Leu Asn Trp Leu Arg Lys Lys Glu Ser Leu Lys Val Asp Arg Ser lie 
355 360 365 

lie Lys lie Thr Gin Glu lie Met Arg Arg Lys Leu Tyr Lys Glu Gly 
370 375 380 

Asn Met Thr Val Asn Gin lie Cys Glu lie Thr Asn Val Ser Arg Ala 
385 390 395 400 

Ser Leu Tyr Arg Lys Leu Ser Glu Val Asn Asn Pro Phe Cys lie Pro 

405 410 415 

Leu Met Gly Asn lie Phe Lys Glu Glu Lys Glu Thr lie Lys Tyr Gin 
420 425 430 

Pro Pro Ser Asp Ala Glu Lys Pro Phe Asp Lys Lys Arg lie lie lie 
435 440 445 

Leu Arg Asn Ser Ser Phe lie Met Met Leu lie Asn Ser Ala Leu Ser 
450 455 460 



Asp Lys Leu Leu Arg Ala Asn Leu Cys Glu Arg Val lie Thr Met Ser 
465 470 475 480 

Asp Lys lie Leu lie Val Asp Asp Glu His Glu lie Ala Asp Leu Val 

485 490 495 

Glu Leu Tyr Leu Lys Asn Glu Asn Tyr Thr Val Phe Lys Tyr Tyr Thr 
500 505 510 

Ala Lys Glu Ala Leu Glu Cys lie Asp Lys Ser Glu lie Asp Leu Ala 
515 520 525 

lie Leu Asp lie Met Leu Pro Gly Thr Ser Gly Leu Thr lie Cys Gin 
530 535 540 

Lys lie Arg Asp Lys His Thr Tyr Pro lie lie Met Leu Thr Gly Lys 
545 550 555 560 

Asp Thr Glu Val Asp Lys lie Thr Gly Leu Thr lie Gly Ala Asp Asp 

565 570 575 

Tyr lie Thr Lys Pro Phe Arg Pro Leu Glu Leu lie Ala Arg Val Lys 
580 585 590 

Ala Gin Leu Arg Arg Tyr Lys Lys Phe Ser Gly Val Lys Glu Gin Asn 
595 600 605 

Glu Asn Val lie Val His Ser Gly Leu Val lie Asn Val Asn Thr His 
610 615 620 

Glu Cys Tyr Leu Asn Glu Lys Gin Leu Ser Leu Thr Pro Thr Glu Phe 
625 630 635 640 

Ser lie Leu Arg lie Leu Cys Glu Asn Lys Gly Asn Val Val Ser Ser 

645 650 655 

Glu Leu Leu Phe His Glu lie Trp Gly Asp Glu Tyr Phe Ser Lys Ser 
660 665 670 

Asn Asn Thr lie Thr Val His lie Arg His Leu Arg Glu Lys Met Asn 
675 680 685 

Asp Thr lie Asp Asn Pro Lys Tyr lie Lys Thr Val Trp Gly Val Gly 
690 695 700 

Tyr Lys lie Glu Lys Lys Lys Arg Leu Phe Gin Thr Arg Thr Lys Thr 
705 710 715 720 

Leu His Val Tyr Arg Cys Asn Cys Cys Gly Ser Asn Cys lie Arg Val 

725 730 735 

Val Tyr Ser Phe Asn Asp Pro Arg Glu Thr Trp Gly Leu Asp Leu Lys 
740 745 750 

Tyr Phe Gly Lys Gin lie Leu Lys Ser Pro Gly Arg Asp Glu lie lie 
755 760 765 



Ser lie Phe His Thr Glu Gin Tyr Arg Tyr Leu Tyr Leu Cys Gly Asp 
770 775 780 

Cys His Tyr Ser Tyr Ser Met Ser Arg His Ala Phe Lys lie Arg Lys 
785 790 795 800 

lie Leu Arg Asp Lys Tyr Arg His Cys Thr Tyr Ser Glu Arg Arg Thr 

805 810 815 

Asn Ala Phe Cys Gly Asn Gly Cys Tyr Gly Thr Lys Ala Gin His lie 
820 825 830 

Lys Thr Asp Ser Gly Lys Ala Arg Ala Gly Cys Lys Ala Gly Arg Thr 
835 840 845 

Lys Lys Lys Arg Cys Tyr Val Leu Gly Ala Arg Tyr Asn Ala Pro Tyr 
850 855 860 

lie His Tyr Arg Leu Phe Glu Pro Ala Arg Gly Ser Arg His Ala Gly 
865 870 875 880 

Arg Ser Lys Gly Lys Val Cys Ala Tyr His Val Gly Gin Ser Val Ser 

885 890 895 

Thr Arg Thr Ala Asn Arg Arg Val Phe Asp Tyr Thr Val Pro Thr Asn 
900 905 910 

Asp Asn Ala Asn Lys Asn Ala His Arg Pro lie Leu Tyr Ala Gly Ala 
915 920 925 

Asp Asp Arg lie Leu Ser Ser Ala Phe Arg Thr Trp Lys Thr Gly Gly 
930 935 940 

Tyr Ser Arg Pro Arg Gly Ser Asp Arg Val Arg Arg Pro Thr Arg Glu 
945 950 955 960 

Ser Leu Gin His Phe Glu Lys Arg Arg Cys lie Gin Gly Gin His His 

965 970 975 

His Tyr Arg Gly Pro Leu Arg Gly Cys Gly Val Asn Arg lie Gin Glu 
980 985 990 

His Trp Lys His Pro Lys Arg Ala Ser Cys His lie Lys Val Leu Ala 
995 1000 1005 

Gly Gin Phe Ser Phe Phe Arg Tyr Gly Trp Arg Gly Thr Trp lie Gly 
1010 1015 1020 

Asp Cys Lys Arg Asn Tyr Cys Ser Ala Trp Arg Ala Asp Leu Arg Gly 
1025 1030 1035 1040 

Lys Leu Leu Tyr Asp Val Gly Arg Ala Ser Ser Asp Ala Arg Leu Gly 

1045 1050 1055 

Lys Glu Val Leu Arg Asp Val Tyr Asn Phe Leu Gly Lys Ser Gin Gly 
1060 1065 1070 



Tyr Leu Tyr Phe Phe Leu Gly Asn Gin Phe Asn lie Lys Lys Arg Leu 
1075 1080 1085 

Val Leu Thr Arg Thr Tyr Arg Lys Asn Glu Pro Phe Ser Phe Phe Arg 
1090 1095 1100 

Glu Arg Phe Asp Lys lie Thr lie Gly lie Pro Val Leu Phe Gly Ala 
1105 1110 1115 1120 

Phe His Arg Lys Gly Trp Ser Leu lie Thr Ser Ala Leu Leu Phe Met 

1125 1130 1135 

Asp Val Ser Arg Met Arg Gin Met His Ser Met Leu Phe Arg Leu Ala 
1140 1145 1150 

Leu Ala Leu Trp Gin Arg Leu Thr Pro Thr Cys Arg Asn Pro Thr Pro 
1155 1160 1165 

Asn Pro Arg Leu Ser lie Asn Val Ser Val Trp Asp lie Asn Gin Arg 
1170 1175 1180 

Phe Pro Pro Leu Phe Phe Leu Arg Arg Glu Pro Val Asn lie Phe Leu 
1185 1190 1195 1200 

Pro Glu Ala Ser Ala Ala lie lie lie Gin Leu Leu Leu Arg Glu Trp 

1205 1210 1215 

Ala Ser Leu Ser Thr Met Trp Arg Thr Arg Arg lie Ala Leu Pro lie 
1220 1225 1230 

lie Leu Cys Phe Leu Trp Gin Tyr Ala Thr Asn Arg Leu Cys Ala Leu 
1235 1240 1245 

Trp Lys Asn Met lie Ser Gly Trp Thr Ala Thr Val Ala Arg Tyr Ser 
1250 1255 1260 

Ala Thr Gin Leu Val Trp Trp Glu Arg Ala Arg Ala Lys Arg Leu Leu 
1265 1270 1275 1280 

Ser Gly Cys Glu Asp Leu Asp Val Lys Cys Trp Leu lie Val Ala Ala 

1285 1290 1295 

Glu Val Arg Thr Met Tyr Arg Leu Met Ser Cys Cys Lys lie Ala lie 
1300 1305 1310 

Ser Leu Arg Phe Met Cys Arg Ser lie Arg lie Arg Thr lie Leu Ser 
1315 1320 1325 

Ala Thr Asn Lys Tyr Arg Glu Ser Lys Glu His Phe Leu Ser lie Leu 
1330 1335 1340 

Gly Ala Val His Leu lie Pro Met Ser Trp Leu Lys His Lys Thr Gly 
1345 1350 1355 1360 

Asn Trp Ala Val Pro His Trp Met Tyr Trp Lys Glu Arg Lys Ser Phe 

1365 1370 1375 



Ser Thr Leu lie Ala Pro Lys Asn Gin Leu lie lie Asn Phe Tyr Leu 
1380 1385 1390 

Asn Phe Lys Glu Cys Leu Thr Ser His Arg lie Arg Pro lie lie Pro 
1395 1400 1405 

Ser Lys Arg Cys Val lie Pro Leu Lys Lys Pro Leu Lys Thr Val Trp 
1410 1415 1420 

lie Leu Lys Gly Asp. Arg Ser Met Asn Arg lie Lys Val Ala lie Leu 
1425 1430 1435 1440 

Phe Gly Gly Cys Ser Glu Glu His Asp Val Ser Val Lys Ser Ala lie 

1445 1450 1455 

Glu lie Ala Ala Asn lie Asn Lys Glu Lys Tyr Glu Pro Leu Tyr lie 
1460 1465 1470 

Gly lie Thr Lys Ser Gly Val Trp Lys Met Cys Glu Lys Pro Cys Ala 
1475 1480 1485 

Glu Trp Glu Asn Asp Asn Cys Tyr Ser Ala Val Leu Ser Pro Asp Lys 
1490 1495 1500 

Lys Met His Gly Leu Leu Val Lys Lys Asn His Glu Tyr Glu lie Asn 
1505 1510 1515 1520 

His Val Asp Val Ala Phe Ser Ala Leu His Gly Lys Ser Gly Glu Asp 

1525 1530 1535 

Gly Ser lie Gin Gly Leu Phe Glu Leu Ser Gly lie Pro Phe Val Gly 
1540 1545 1550 

Cys Asp lie Gin Ser Ser Ala lie Cys Met Asp Lys Ser Leu Thr Tyr 
1555 1560 1565 

lie Val Ala Lys Asn Ala Gly lie Ala Thr Pro Ala Phe Trp Val lie 
1570 1575 1580 

Asn Lys Asp Asp Arg Pro Val Ala Ala Thr Phe Thr Tyr Pro Val Phe 
1585 1590 1595 1600 

Val Lys Pro Ala Arg Ser Gly Ser Ser Phe Gly Val Lys Lys Val Asn 

1605 1610 1615 

Ser Ala Asp Glu Leu Asp Tyr Ala lie Glu Ser Ala Arg Gin Tyr Asp 
1620 1625 1630 

Ser Lys lie Leu lie Glu Gin Ala Val Ser Gly Cys Glu Val Gly Cys 
1635 1640 1645 

Ala Val Leu Gly Asn Ser Ala Ala Leu Val Val Gly Glu Val Asp Gin 
1650 1655 1660 

lie Arg Leu Gin Tyr Gly lie Phe Arg lie His Gin Glu Val Glu Pro 
1665 1670 1675 1680 



Glu Lys Gly Ser Glu Asn Ala Val lie Thr Val Pro Ala Asp Leu Ser 

1685 1690 1695 

Ala Glu Glu Arg Gly Arg lie Gin Glu Thr Ala Lys Lys lie Tyr Lys 
1700 1705 1710 

Ala Leu Gly Cys Arg Gly Leu Ala Arg Val Asp Met Phe Leu Gin Asp 
1715 1720 1725 

Asn Gly Arg lie Val Leu Asn Glu Val Asn Thr Leu Pro Gly Phe Thr 
1730 1735 1740 

Ser Tyr Ser Arg Tyr Pro Arg Met Met Ala Ala Ala Gly lie Ala Leu 
1745 1750 1755 1760 

Pro Glu Leu lie Asp Arg Leu lie Val Leu Ala Leu Lys Gly Ala Trp 

1765 1770 1775 

Lys Asp Leu Leu Phe Met Lys Tyr Thr Val Phe Val Gly Thr Leu Asn 
1780 1785 1790 

Met Pro Leu Gly lie lie Ser Pro Glu Asn Arg Leu Thr Val Met Lys 
1795 1800 1805 

lie Ala Leu Gly His Thr Ser Trp Leu Asn Arg Phe Arg Gin Lys Asn 
1810 1815 1820 

Trp Leu Leu Pro Lys Gly Thr Asp Cys Phe Tyr Gly Thr Val Thr Val 
1825 1830 1835 1840 

Leu Ser Val Leu Thr Val Leu Cys Asn Gly Leu His Ser Arg Lys lie 

1845 1850 1855 

Thr Gin Arg Lys Val lie lie Pro lie Leu Thr Glu Leu Arg Phe Gin 
1860 1865 1870 

Lys Asp Thr Trp Leu Gin Asn Gin Ala lie Ala Ala Ala Val Pro Leu 
1875 1880 1885 

lie Leu Arg Phe lie Asp Thr Arg Val Ser Leu Tyr Gin Trp Gly Ala 
1890 1895 1900 

Asp Leu lie Leu Trp Met Asn Ala Leu lie Met Arg Gin Met Glu Tyr 
1905 1910 1915 1920 

His Ala Met Lys Arg Lys lie Ala Asp Val Cys Ala Pro Ser Trp Lys 

1925 1930 1935 

Thr Val Gly Leu Lys His lie Ala Ser Asn Gly Gly Thr Met Tyr Glu 
1940 1945 1950 

Thr Asn His Thr Pro lie Ala lie Leu lie Ser Pro Leu Asn Lys Leu 
1955 1960 1965 

Leu Thr Val Ala Arg Thr Asn Tyr lie Ser Leu Phe Arg Gin Glu Thr 
1970 1975 1980 



• 



Arg Arg Met Leu Val Leu Arg Glu Phe lie Tyr Ser Arg Tyr Arg Cys 
1985 1990 1995 2000 

Lys Ala Glu Arg Tyr Cys Gly His Tyr Leu Arg Ala Leu Arg Gin Asp 

2005 2010 2015 

Ser Leu lie lie Arg Leu lie Ala Arg Gly Gly lie Ser His Arg Pro 
2020 2025 2030 

Leu Ser Thr Gly Ser Ser Ala Ser Leu Asn Ser Ala Trp Val Ser Leu 
2035 2040 2045 

Met Lys lie His Leu His Trp lie Gin Gly Glu lie lie Asp Cys Asn 
2050 2055 2060 

Leu Arg Gly Lys Thr Ala Gin Ser Gin Thr Arg Leu Cys Arg Leu Arg 
2065 2070 2075 2080 

Gly Arg Phe Lys Tyr Phe lie Leu Pro Thr lie Leu Arg Arg Arg Leu 

2085 2090 2095 

Lys Met Lys Lys Leu Phe Phe Leu Leu Leu Leu Leu Phe Leu lie Tyr 
2100 2105 2110 

Leu Gly Tyr Asp Tyr Val Asn Glu Ala Leu Phe Ser Gin Glu Lys Val 
2115 2120 2125 

Glu Phe Gin Asn Tyr Asp Gin Asn Pro Lys Glu His Leu Glu Asn Ser 
2130 2135 2140 

Gly Thr Ser Glu Asn Thr Gin Glu Lys Thr lie Thr Glu Glu Gin Val 
2145 2150 2155 2160 

Tyr Gin Gly Asn Leu Leu Leu lie Asn Ser Lys Tyr Pro Val Arg Gin 

2165 2170 2175 

Glu Val Ser Gin lie Ser lie Tyr Leu Asn Met Thr Asn Met Asp Thr 
2180 2185 2190 

Gly Cys Leu lie Val lie Phe lie Cys Gin Lys Lys His Lys Asn Phe 
2195 2200 2205 

Gin Arg Trp Ser Met Met Leu Arg Val Ala Leu Val lie Leu Leu Leu 
2210 2215 2220 

lie Val Ala lie Glu Thr Leu Met Ser Lys Val Cys Phe Thr Lys Lys 
2225 2230 2235 2240 

Trp Gly Leu Ser Met Pro Tyr Gin Gin Val lie Val Ser lie lie Gin 

2245 2250 2255 

Val Tyr His Met Asp Gin Ala Arg Lys Trp Asn Glu Pro Leu Lys Glu 
2260 2265 2270 

Ser Gly Lys Lys Met Leu Gly Asn Thr Gly Ser Phe Tyr Val lie Gin 
2275 2280 2285 




Arg Thr Lys Gin Ser Gin Glu Phe 
2290 2295 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 54 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Ser Phe Ser Phe Cys Ser Phe Val Arg Asp Leu Leu Thr Val Leu Asn 
1 5 10 15 

Ser Phe Phe Ser Ala lie Ala Leu Ala Ser His Thr lie Leu Ser Ser 
20 25 30 

Val Val lie Ala Gly Ser lie lie Leu Phe Phe Leu Arg Lys Ser Met 
35 40 45 

His Ser Cys Ser Arg Met Ala Ser Pro Phe Ser Lys Ala Asn Arg Tyr 
50 55 60 

Leu Asn Val lie Arg Tyr Ser Leu Arg Val Lys Val Thr Lys Ser Tyr 
65 70 75 80 

Ser Leu Arg lie Ser Phe Lys Ser Gin Ser Val Phe Ser Leu Gly Ser 

85 90 95 

Ser Glu Asp Gly Leu Thr Pro lie Cys Phe Asp lie Tyr Cys Met Thr 
100 105 110 

Glu Ser Gly Met Leu Leu lie Val Tyr Gly Gin Pro Gly Tyr Arg Arg 
115 120 125 

Thr Ala Asn Thr Ala Asn Pro Lys Arg Phe Ser Ser Leu Leu Arg Leu 
130 135 140 

Leu Thr lie Ser Lys Ser Arg Leu Glu Lys Val Lys Val Pro Ser lie 
145 150 155 160 

His Ser Ser Ser Gly lie Cys lie Lys Ala Cys Leu Cys Ser Gly Val 

165 170 175 

Ser Asn Ser Leu Pro Leu Ala lie Phe lie Gin Tyr His Ser lie Ser 
180 185 190 

Val Phe Ser lie Tyr Phe Asn Tyr lie Ser lie Glu Cys Thr Leu Leu 
195 200 205 

lie Gin Met Thr Asp Lys lie lie Val Lys Ser Val Ser Asp Leu Ser 
210 215 220 



Gin Lys Gly Asp lie Leu Arg Lys lie Gly Tyr lie Arg Val Ser Ser 
225 230 235 240 

Thr Asn Gin Asn Pro Ser Arg Gin Phe Gin Gin Leu Asn Glu lie Gly 

245 250 255 

Met Asp lie lie Arg Glu Ser Phe Arg Ser Asn Lys Gly Ser Arg Ala 
260 265 270 

Thr Ser Lys Ser Val Arg Arg Phe Thr Gly Arg His His Leu Cys Tyr 
275 280 285 

Arg Leu Asn Ser Asn His Ser Tyr Thr Arg Ser lie lie Asn Arg His 
290 295 300 

Thr Arg Lys Gly Lys Phe Lys lie Thr Lys Arg Tyr Met Ala Phe lie 
305 310 315 320 

Arg Arg Ser lie Gin Pro lie Leu Asn Tyr Cys Asn Gly Trp Cys Pro 

325 330 335 

lie Arg Ala Arg Ser Tyr Ser Asp Glu Thr Thr Arg Asp lie Gly Glu 
340 345 350 

Arg Arg Lys Val Arg Ser lie Lys Glu Val Ser Lys Ser Arg Arg Asn 
355 360 365 

Glu Leu Cys Gly Glu Ser Tyr lie Lys Lys Glu lie Leu lie Lys Phe 
370 375 380 

Val Lys Leu Leu Met Tyr Leu Gly Leu His Tyr Thr Gly Asn Tyr Gin 
385 390 395 400 

Lys lie lie Ser His Ser Val Phe Arg Trp Ala lie Phe Leu Lys Lys 

405 410 415 

Lys Arg Lys Leu Asn lie Asn Ser Leu Leu Ala Met Pro Lys Ser Pro 
420 425 430 

Leu lie Lys Lys Glu Ser Ser Ser Glu lie Leu Ser His Leu Leu Cys 
435 440 445 

Lys Cys Leu lie Arg Pro Tyr Asn Leu lie Asn Tyr Gly Gin Thr Tyr 
450 455 460 

Val Lys Gly Leu Ala lie Lys Tyr Leu Leu Trp Met Met Asn Met Lys 
465 470 475 480 

Leu Pro lie Trp Leu Asn Tyr Thr Lys Thr Arg lie lie Arg Phe Ser 

485 490 495 

Asn Thr lie Pro Pro Lys Lys His Trp Asn Val Thr Ser Leu Arg Leu 
500 505 510 

Thr Leu Pro Tyr Trp Thr Ser Cys Phe Pro Ala Gin Ala Ala Leu Leu 
515 520 525 



Ser Val Lys Lys Gly Thr Ser Thr Pro lie Arg Leu Ser Cys Pro Gly 
530 535 540 

Lys lie Gin Arg lie Lys Leu Gin Gly Gin Ser Ala Arg Met lie lie 
545 550 555 560 

Arg Ser Pro Phe Ala His Trp Ser Leu Leu Gly Arg Pro Ser Cys Ala 

565 570 575 

Asp Thr Lys Asn Ser Val Glu Arg Ser Arg Thr Lys Met Leu Ser Ser 
580 585 590 

Thr Pro Ala Leu Ser Leu Met Leu Thr Pro Met Ser Val lie Thr Arg 
595 600 605 

Ser Ser Tyr Pro Leu Leu Pro Pro Ser Phe Gin Tyr Cys Glu Ser Ser 
610 615 620 

Val Lys Thr Arg Gly Met Trp Leu Ala Pro Ser Cys Tyr Phe Met Arg 
625 630 635 640 

Tyr Gly Ala Thr Asn lie Ser Ala Arg Ala Thr Thr Pro Ser Pro Cys 

645 650 655 

lie Ser Gly lie Cys Ala Lys Lys Thr Thr Pro Leu lie lie Arg Asn 
660 665 670 

lie Lys Arg Tyr Gly Gly Leu Val lie Lys Leu Lys Asn Lys Lys Asn 
675 680 685 

Asp Tyr Ser Lys Leu Glu Arg Lys Leu Tyr Met Tyr lie Val Ala lie 
690 695 700 

Val Val Val Ala lie Val Phe Val Leu Tyr lie Arg Ser Met lie Arg 
705 710 715 720 

Gly Lys Leu Gly Asp Trp lie Leu Ser lie Leu Glu Asn Lys Tyr Asp 

725 730 735 

Leu Asn His Leu Asp Ala Met Lys Leu Tyr Gin Tyr Ser lie Arg Asn 
740 745 750 

Asn lie Asp lie Phe lie Tyr Val Ala lie Val lie Ser lie Leu lie 
755 760 765 

Leu Cys Arg Val Met Leu Ser Lys Phe Ala Lys Tyr Phe Asp Glu lie 
770 775 780 

Asn Thr Gly lie Asp Val Leu lie Gin Asn Glu Asp Lys Gin lie Glu 
785 790 795 800 

Leu Ser Ala Glu Met Asp Val Met Glu Gin Lys Leu Asn Thr Leu Lys 

805 810 815 

Arg Thr Leu Glu Lys Arg Glu Gin Asp Ala Lys Leu Ala Glu Gin Arg 
820 825 830 



Lys Asn Asp Val Val Met Tyr Leu Ala His Asp lie Lys Thr Pro Leu 
835 840 845 

Thr Ser lie lie Gly Tyr Leu Ser Leu Leu Asp Glu Ala Pro Asp Met 
850 855 860 

Pro Val Asp Gin Lys Ala Lys Tyr Val His lie Thr Leu Asp Lys Ala 
865 870 875 880 

Tyr Arg Leu Glu Gin Leu lie Asp Glu Phe Phe Glu lie Thr Arg Tyr 

885 890 895 

Asn Leu Gin Thr lie Thr Leu Thr Lys Thr His lie Asp Leu Tyr Tyr 
900 905 910 

Met Leu Val Gin Met Thr Asp Glu Phe Tyr Pro Gin Leu Ser Ala His 
915 920 925 

Gly Lys Gin Ala Val lie His Ala Pro Glu Asp Leu Thr Val Ser Gly 
930 935 940 

Asp Pro Asp Lys Leu Ala Arg Val Phe Asn Asn lie Leu Lys Asn Ala 
945 950 955 960 

Ala Ala Tyr Ser Glu Asp Asn Ser lie lie Asp lie Thr Ala Gly Leu 

965 970 975 

Ser Gly Asp Val Val Ser lie Glu Phe Lys Asn Thr Gly Ser lie Pro 
980 985 990 

Lys Asp Lys Leu Ala Ala lie Phe Glu Lys Phe Tyr Arg Leu Asp Asn 
995 1000 1005 

Ser Arg Ser Ser Asp Thr Gly Gly Ala Gly Leu Gly Leu Ala lie Ala 
1010 1015 1020 

Lys Glu lie lie Val Gin His Gly Gly Gin lie Tyr Ala Glu Ser Tyr 
1025 1030 1035 1040 

Asp Asn Tyr Thr Thr Phe Arg Val Glu Leu Pro Ala Met Pro Asp Leu 

1045 1050 1055 

Val Asp Lys Arg Arg Ser Glu Met Tyr lie lie Phe Glu Asn Leu Lys 
1060 1065 1070 

Val lie Phe Thr Phe Ser Glu lie Asn Asn Leu lie Leu Arg Asn Gly 
1075 1080 1085 

Ser Phe Leu His Gly Arg Leu Asn Thr Val Arg Thr Ser Arg Phe Arg 
1090 1095 1100 

Ser Ser Glu Lys Asp Leu Thr Arg Leu Pro Leu Ala Ser Pro Phe Tyr 
1105 1110 1115 1120 

Leu Val Pro Phe Thr Glu Arg Val Gly Leu Asn Tyr Glu His Arg His 

1125 1130 1135 



Tyr Cys Leu Trp Met Ala Gly Gly Arg Cys lie Pro Cys Ser Phe Ala 
1140 1145 1150 

Ser Leu Trp Arg Tyr Gly Asn Asp Asn Arg Gin Arg Val Gly lie Gin 
1155 1160 1165 

Arg Gin He Arg Ala Phe Gin Ser Met Tyr Gin Cys Gly Thr He Arg 
1170 1175 1180 

Asp Phe Arg Leu Tyr Ser Ser Cys Ala Glu Glu Ser Arg Cys Glu lie 
1185 1190 1195 1200 

Tyr Phe Tyr Pro Lys His Arg Leu Gin Ser Tyr Arg Tyr Asn Cys Cys 

1205 1210 1215 

Glu Asn Gly His His Cys Arg Gin Cys Gly Val Leu Ala Gly Arg Cys 
1220 1225 1230 

Arg Leu Tyr Tyr Asp Ala Asn Ser Tyr Gly Ser Thr Gin Arg Lys He 
1235 1240 1245 

Asp Cys Ala Leu Cys Gly Lys Thr Phe Gin Val Gly Gin Arg Pro Trp 
1250 1255 1260 

Gin Gly Thr Gin Arg His Asp Ser Trp Cys Gly Gly Asn Gly Pro Asp 
1265 1270 1275 1280 

Arq Gin Ser Gly Tyr Ala Ala Ala Arg He Trp Met Ser Val Gly Leu 

1285 1290 1295 

Ser Gin Pro Lys Tyr Arg Gly Lys Leu Cys Thr Val Val Ala Ala Lys 
1300 1305 1310 

Arg Tyr Arg Tyr Ala Ser Cys Ala Ala Gin Tyr Gly Tyr Ala Leu Tyr 
1315 1320 1325 

Tyr Gin Pro Arg Thr Asn Thr Glu Asn Glu Ala Arg Ser He Ser Tyr 
1330 1335 1340 

Gin Tyr Trp Ala Arg Ser Thr Cys Arg Tyr Leu Val Gly Ser He Arg 
1345 1350 1355 1360 

Lys Arg Glu Thr Gly Arg Cys Arg He Gly Cys He Gly Arg Arg Gly 

1365 1370 1375 

Arg Val Phe Leu Leu Leu His Pro Lys Thr Asn Ser He Phe Thr Thr 
1380 1385 1390 

Ser Lys Asn Ala Arg Asp Asn His Thr Ala Tyr Gly Leu Leu Tyr Arg 
1395 1400 1405 

Ala Ser Val Ala Tyr Arg Lys Asn His Lys Leu Phe Gly Phe Lys Glu 
1410 1415 1420 

Thr Gly Ala He Glu Lys Leu Gin Tyr Cys Leu Gly Val Ala Gin Arg 
1425 1430 1435 1440 



Ser Met Thr Tyr Arg Asn Leu Gin Arg Pro Leu Thr Leu lie Lys Lys 

1445 1450 1455 

Asn Thr Ser Arg Tyr Thr Leu Glu Leu Arg Asn Leu Val Tyr Gly Lys 
1460 1465 1470 

Cys Ala Lys Asn Leu Ala Arg Asn Gly Lys Thr Thr lie Ala lie Gin 
1475 1480 1485 

Leu Tyr Ser Arg Arg lie Lys Lys Cys Thr Asp Tyr Leu Leu Lys Arg 
1490 1495 1500 

Thr Met Asn Met Lys Ser Thr Met Leu Met His Phe Gin Leu Cys Met 
1505 1510 1515 1520 

Ala Ser Gin Val Lys Met Asp Pro Tyr Lys Val Cys Leu Asn Cys Pro 

1525 1530 1535 

Val Ser Leu Leu Ala Ala lie Phe Lys Ala Gin Gin Phe Val Trp Thr 
1540 1545 1550 

Asn Arg His Thr Ser Leu Arg Lys Met Leu Gly Leu Leu Pro Pro Phe 
1555 1560 1565 

Gly Leu Leu lie Lys Met lie Gly Arg Trp Gin Leu Arg Leu Pro lie 
1570 1575 1580 

Leu Phe Leu Leu Ser Arg Arg Val Gin Ala His Pro Ser Val Lys Lys 
1585 1590 1595 1600 

Ser lie Ala Arg Thr Asn Trp Thr Thr Gin Leu Asn Arg Gin Asp Asn 

1605 1610 1615 

Met Thr Ala Lys Ser Leu Ser Arg Leu Phe Arg Ala Val Arg Ser Val 
1620 1625 1630 

Val Arg Tyr Trp Glu Thr Val Pro Arg Leu Leu Ala Arg Trp Thr Lys 
1635 1640 1645 

Ser Gly Cys Ser Thr Glu Ser Phe Val Phe lie Arg Lys Ser Ser Arg 
1650 1655 1660 

Lys Lys Ala Leu Lys Thr Gin Leu Pro Phe Pro Gin Thr Phe Gin Gin 
1665 1670 1675 1680 

Arg Ser Glu Asp Gly Tyr Arg Lys Arg Gin Lys Lys Tyr lie Lys Arg 

1685 1690 1695 

Ser Ala Val Glu Val Pro Val Trp lie Cys Phe Tyr Lys lie Thr Ala 
1700 1705 1710 

Ala Leu Tyr Thr Lys Ser lie Leu Cys Pro Val Ser Arg His Thr Val 
1715 1720 1725 

Val lie Pro Val Trp Pro Leu Gin Val Leu His Phe Pro Asn Leu Thr 
1730 1735 1740 



Ala Ser Tyr Arg Arg Gly Asp Lys His Gly Asn Arg lie Tyr Phe Phe 
1745 1750 1755 1760 

Arg Asn Ser Thr Arg Cys Ser Leu Gly Arg lie Cys His Leu Gly Phe 

1765 1770 1775 

His Arg Lys Thr Gly Arg Leu Ser Lys Ser His Cys Arg Asp lie Arg 
1780 1785 1790 

Val Gly lie Ala Phe Glu Gly Lys Arg Thr Gly Cys Tyr Pro Arg Val 
1795 1800 1805 

Arg lie Ala Ser Met Gly Arg Leu Pro Ser Ala Cys Cys Lys Leu Phe 
1810 1815 1820 

Tyr Ala Met Gly Cys Thr Ala Gly Lys Pro Asp Lys Gly Lys Leu Leu 
1825 1830 1835 1840 

Ser Gin Tyr Pro Asn Asp Asp Phe Lys Arg lie Arg Gly Phe Lys lie 

1845 1850 1855 

Lys Pro Pro Arg Gin Cys His Ser Tyr Ala Leu Ser lie Arg His Gly 
1860 1865 1870 

Ala Cys Thr Asn Gly Glu Pro lie Phe Tyr Gly Thr Leu Ser Ser Cys 
1875 1880 1885 

Gly Lys Trp Asn lie Met Gin Ser Ala Lys Ser Gin Thr Phe Ala Leu 
1890 1895 1900 

His His Gly Lys Gin Trp Val Ser lie Pro Arg Met Val Ala Leu Cys 
19 05 1910 1915 192 0 

lie Lys Arg Arg Thr lie Pro Gin Leu Phe Phe Pro Arg lie Asn Phe 

1925 1930 1935 

Pro Leu His Gly Gin Thr lie Ala Asn Ser Phe Gly Arg Lys Pro Asp 
1940 1945 1950 

Val Cys Asn Trp Phe Leu Gly Asn Leu Tyr lie Val Asp Ser lie Glu 
1955 1960 1965 

Asp Val Arg Gin Ser Asp lie Ala Val lie lie Cys Val Arg Cys Gly 
1970 1975 1980 

Lys lie Ala Asp Ser His Arg Gly Val Val Phe His Thr Ala His Cys 
1985 1990 1995 2000 

Gin Gin Ala Val Gin Pro Arg lie Gin His Gly Tyr His Leu Lys Phe 

2005 2010 2015 

lie Tyr lie Gly Asp Asn Ser Lys Ser Ser Arg Ala Lys Leu Thr Val 
2020 2025 2030 

lie Tyr Gly Ala Lys Arg His Asn Leu Lys Arg Asp Cys Ala Val Gly 
2035 2040 2045 



Glu Asp Ser Arg Asn lie Ser Tyr Phe Gin Leu Tyr Ser Gly Gly Asp 
2050 2055 2060 

Lys Arg Ser Cys Phe Phe Tyr Cys Tyr Cys Tyr Ser Tyr Thr Val Met 
2065 2070 2075 2080 

Thr Thr Leu Met Lys His Cys Phe Leu Arg Lys Lys Ser Asn Phe Lys 

2085 2090 2095 

lie Met lie Lys lie Pro Lys Asn lie Lys lie Val Gly Leu Leu Lys 
2100 2105 2110 

lie Pro Lys Arg Lys Gin Leu Gin Lys Asn Arg Phe lie Lys Glu lie 
2115 2120 2125 

Cys Tyr Ser lie Val Asn lie Leu Phe Ala Lys Lys Cys Glu Val Arg 
2130 2135 2140 

Tyr Arg Glu Phe lie Thr Arg lie Asn Lys Trp lie Arg Val Ala Tyr 
2145 2150 2155 2160 

Leu Tyr Val Lys Arg Asn Ser Thr Lys lie Phe Arg Asp Gly Gin Cys 

2165 2170 2175 

Cys Lys Gly Trp Arg Ser Phe Tyr Tyr Trp Leu Ser Arg Leu Ala Lys 
2180 2185 2190 

Cys Ala Leu Pro Arg Asn Gly Gly Val Cys Leu Thr Ser Arg Leu Ala 
2195 2200 2205 

Phe Arg Phe lie Thr Arg Cys Arg lie Lys Leu Asp Glu Asn Gly Thr 
2210 2215 2220 

Ser Pro Arg Lys Val Asp Arg Arg Lys Cys Leu Glu lie Arg Val His 
2225 2230 2235 2240 

Phe Thr Leu Ser Arg Gly Gin Asn Arg Val Asn Arg Asn Ser 

2245 2250 

INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2291 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Ala Phe Leu Phe Ala His Leu Leu Glu lie Tyr Pro Tyr lie Ala Ser 
15 10 15 

Phe Gin Pro Leu Pro Leu Leu Pro Thr Pro Phe Phe Gin Val Gin Ala 
20 25 30 



* 



Val Phe Cys Phe Phe Leu Glu Asn Leu Cys lie His Ala Val Asp Glu 
35 40 45 

Trp His His His Phe Pro Lys Leu lie Asp Glu Gly Thr Met Ser Phe 
50 55 60 

Asp lie His Ser Gly Lys Leu Gin Ser Arg lie His Phe Glu Phe Leu 
65 70 75 80 

Ser Asn Asp Pro Lys Val Tyr Phe Pro Phe Glu Asp Asn Asp Gin Ala 

85 90 95 

Arg Met Asp His Gin Ser Val Ser lie Tyr lie Val Pro Asn Leu Gly 
100 105 110 

Cys Phe Tyr Glu Cys Met Ala Asn Arg Asp Thr Glu Glu Gin Leu lie 
115 120 125 

Glu Gin Gin lie Leu Asn Gly Phe Leu Pro Ser Phe Ala Tyr Leu Phe 
130 135 140 

Leu Asn Pro Val Trp Lys Lys Ser Arg Ser Pro Val Ser lie His Leu 
145 150 155 160 

Gin Gly Phe Ala Lys Pro Val Ser Val Pro Val Ala lie Leu Tyr Leu 

165 170 175 

Ser Gin Phe Ser Phe Ser lie lie Pro Phe Leu Tyr Phe Gin Phe lie 
180 185 190 

Ser Ser lie lie Tyr Gin Ser Val Leu Tyr Tyr Lys Cys Ser Arg Leu 
195 200 205 

lie Lys Ser Leu Arg Ala Ser His Lys Thr Cys Leu Lys Asn Glu Val 
210 215 220 

lie Phe Cys Gly Lys Ser Val lie Phe Val Ser Val Arg Leu Thr Arg 
225 230 235 240 

lie Leu Gin Asp Asn Phe Ser Ser Thr Arg Ser Glu Trp lie Leu Tyr 

245 250 255 

Lys Glu Lys Val Ser Gly Ala Thr Lys Asp Arg Glu Gin Leu Gin Lys 
260 265 270 

Val Leu Asp Asp Leu Gin Glu Asp Asp lie lie Tyr Val Thr Asp Leu 
275 280 285 

Thr Arg lie Thr Arg Ser Thr Gin Asp Leu Phe Glu Leu lie Asp Asn 
290 295 300 

lie Arg Asp Lys Lys Ala Ser Leu Lys Ser Leu Lys Asp Thr Trp Leu 
305 310 315 320 

Asp Leu Ser Glu Asp Asn Pro Tyr Ser Gin Phe Leu lie Thr Val Met 

325 330 335 



Ala Gly Val Asn Gin Leu Glu Arg Asp Leu lie Arg Met Arg Gin Arg 
340 345 350 

Glu Gly lie Glu Leu Ala Lys Lys Glu Gly Lys Phe Lys Gly Arg Leu 
355 360 365 

Lys Lys Tyr His Lys Asn His Ala Gly Met Asn Tyr Ala Ala Lys Ala 
370 375 380 

lie Arg Arg Lys Tyr Asp Cys Lys Ser Asn Leu Asn Tyr Cys lie Gly 
385 390 395 400 

Phe lie lie Gin Glu lie lie Arg Ser Glu Leu Ala lie Leu Tyr Ser 

405 410 415 

Ala Asn Gly Gin Tyr Phe Arg Arg Lys Gly Asn Tyr Lys lie Leu Thr 
420 425 430 

Ala Ser Arg Cys Arg Lys Ala Leu Lys Lys Asn His His Leu Lys Lys 
435 440 445 

Phe Leu Val lie Tyr Tyr Val Asn Ala Tyr Lys Phe Gly Pro lie lie 
450 455 460 

lie lie Lys Gly Lys Leu Met Lys Gly Asp Asn Tyr Glu Arg Asn Thr 
465 470 475 480 

Tyr Cys Gly Thr Asn Cys Arg Phe Gly lie lie Leu Lys Lys Arg Glu 

485 490 495 

Leu Tyr Gly Phe Gin lie Leu Tyr Arg Gin Arg Ser lie Gly Met Tyr 
500 505 510 

Arg Gin Val Asp Pro Cys His lie Gly His His Ala Ser Arg His Lys 
515 520 525 

Arg Pro Tyr Tyr Leu Ser Lys Asn Lys Gly Gin Ala His Leu Ser Asp 
530 535 540 

Tyr His Ala Asp Arg Glu Arg Tyr Arg Gly Arg Asn Tyr Arg Val Asn 
545 550 555 560 

Asn Arg Arg Gly Leu Tyr Asn Glu Ala Leu Ser Pro Thr Gly Val Asn 

565 570 575 

Cys Ser Gly Lys Gly Pro Val Ala Pro lie Gin Lys lie Gin Trp Ser 
580 585 590 

Lys Gly Ala Glu Arg Lys Cys Tyr Arg Pro Leu Arg Pro Cys His Cys 
595 600 605 

His Pro Val Leu Ser Glu Arg Glu Ala Val lie Pro Tyr Ser His Arg 
610 615 620 

Val Phe Asn Thr Ala Asn Pro Leu Lys Gin Gly Glu Cys Gly Leu Arg 
625 630 635 640 



Ala Ala lie Ser Asp Met Gly Arg Arg lie Phe Gin Gin Glu Gin Gin 

645 650 655 

His His His Arg Ala Tyr Pro Ala Phe Ala Arg Lys Asn Glu Arg His 
660 665 670 

His Ser Glu lie Tyr Lys Asn Gly Met Gly Gly Trp Leu Asn Lys lie 
675 680 685 

Lys Lys Thr Thr lie Pro Asn Asn Glu Asn Phe Thr Cys lie Ser Leu 
690 695 700 

Gin Leu Leu Trp Gin Leu Tyr Ser Cys Cys lie Phe Val Gin Ser Glu 
705 710 715 720 

Gly Asn Leu Gly lie Gly Ser Val Phe Trp Lys Thr Asn Met Thr lie 

725 730 735 

Thr Trp Thr Arg Asn Tyr lie Asn lie Pro Tyr Gly Thr lie lie Ser 
740 745 750 

Leu Phe Met Trp Arg Leu Ser Leu Val Phe Leu Phe Tyr Val Ala Ser 
755 760 765 

Cys Phe Gin Asn Ser Gin Asn Thr Leu Thr Arg lie Pro Ala Leu Met 
770 775 780 

Tyr Leu Phe Arg Thr Lys lie Asn Lys Leu Ser Phe Leu Arg Lys Trp 
785 790 795 800 

Met Leu Trp Asn Lys Ser Ser Thr His Asn Gly Leu Trp Lys Ser Glu 

805 810 815 

Ser Arg Met Gin Ser Trp Pro Asn Lys Glu Lys Met Thr Leu Leu Cys 
820 825 830 

Thr Trp Arg Thr lie Leu Lys Arg Pro Leu His Pro Leu Ser Val lie 
835 840 845 

Ala Cys Leu Thr Arg Leu Gin Thr Cys Arg lie Lys Arg Gin Ser Met 
850 855 860 

Cys lie Ser Arg Trp Thr Lys Arg lie Asp Ser Asn Ser Ser Thr Ser 
865 870 875 880 

Phe Leu Arg Leu His Gly lie Thr Tyr Lys Arg Arg Gin Lys Arg Thr 

885 890 895 

Thr Tyr Thr lie Cys Trp Cys Arg Pro Met Asn Phe lie Leu Ser Phe 
900 905 910 

Pro His Met Glu Asn Arg Arg Leu Phe Thr Pro Pro Arg lie Pro Cys 
915 920 925 

Pro Ala Thr Leu lie Asn Ser Arg Glu Ser Leu Thr Thr Phe Lys Thr 
930 935 940 



Pro Leu His Thr Val Arg lie Thr Ala Ser Leu Thr Leu Pro Arg Ala 
945 950 955 960 

Ser Pro Gly Met Trp Cys Gin Ser Asn Ser Arg Thr Leu Glu Ala Ser 

965 970 975 

Gin Lys lie Ser Leu Pro Tyr Leu Lys Ser Ser lie Gly Trp Thr lie 
980 985 990 

Leu Val Leu Pro lie Arg Val Ala Arg Asp Leu Asp Trp Arg Leu Gin 
995 1000 1005 

Lys Lys Leu Leu Phe Ser Met Glu Gly Arg Phe Thr Arg Lys Ala Met 
1010 1015 1020 

lie Thr lie Arg Arg Leu Gly Ser Phe -Gin Arg Cys Gin Thr Trp Leu 
1025 1030 1035 1040 

lie Lys Gly Gly Pro Lys Arg Cys lie Phe Phe Arg Lys lie Ser Arg 

1045 1050 1055 

Leu Ser Leu Leu Phe Leu Arg Lys Leu Thr lie Tyr Glu Thr Ala Arg 
1060 1065 1070 

Ser Tyr Thr Val Asp Leu lie Pro Glu Arg Ala Val Phe Val Leu Gin 
1075 1080 1085 

Arg Lys lie Gin Asp Tyr His Trp His Pro Arg Phe lie Trp Cys Leu 
1090 1095 1100 

Ser Gin Lys Gly Leu Val Leu lie Met Asn Asn lie Gly lie Thr Val 
1105 1110 1115 1120 

Tyr Gly Cys Glu Gin Asp Glu Ala Asp Ala Phe His Ala Leu Ser Pro 

1125 1130 1135 

Arg Phe Gly Val Met Ala Thr lie lie Asn Ala Asn Val Ser Glu Ser 
1140 1145 1150 

Asn Ala Lys Ser Ala Pro Phe Asn Gin Cys lie Ser Val Gly His Lys 
1155 1160 1165 

Ser Glu lie Ser Ala Ser lie Leu Leu Ala Leu Lys Arg Ala Gly Val 
1170 1175 1180 

Lys Tyr lie Ser Thr Arg Ser lie Gly Cys Asn His lie Asp Thr Thr 
1185 1190 1195 1200 

Ala Ala Lys Arg Met Gly lie Thr Val Asp Asn Val Ala Tyr Ser Pro 

1205 1210 1215 

Asp Ser Val Ala Asp Tyr Thr Met Met Leu lie Leu Met Ala Val Arg 
1220 1225 1230 

Asn Val Lys Ser lie Val Arg Ser Val Glu Lys His Asp Phe Arg Leu 
1235 1240 1245 



Asp Ser Asp Arg Gly Lys Val Leu Ser Asp Met Thr Val Gly Val Val 
1250 1255 1260 

Gly Thr Gly Gin lie Gly Lys Ala Val lie Glu Arg Leu Arg Gly Phe 
1265 1270 1275 1280 

Gly Cys Lys Val Leu Ala Tyr Ser Arg Ser Arg Ser lie Glu Val Asn 

1285 1290 1295 

Tyr Val Pro Phe Asp Glu Leu Leu Gin Asn Ser Asp lie Val Thr Leu 
1300 1305 1310 

His Val Pro Leu Asn Thr Asp Thr His Tyr lie lie Ser His Glu Gin 
1315 1320 1325 

lie Gin Arg Met Lys Gin Gly Ala Phe Leu lie Asn Thr Gly Arg Gly 
1330 1335 1340 

Pro Leu Val Asp Thr Tyr Glu Leu Val Lys Ala Leu Glu Asn Gly Lys 
1345 1350 1355 1360 

Leu Gly Gly Ala Ala Leu Asp Val Leu Glu Gly Glu Glu Glu Phe Phe 

1365 1370 1375 

Tyr Ser Asp Cys Thr Gin Lys Pro lie Asp Asn Gin Phe Leu Leu Lys 
1380 1385 1390 

Leu Gin Arg Met Pro Asn Val lie lie Thr Pro His Thr Ala Tyr Tyr 
1395 1400 1405 

Thr Glu Gin Ala Leu Arg Asp Thr Val Glu Lys Thr lie Lys Asn Cys 
1410 1415 1420 

Leu Asp Phe Glu Arg Arg Gin Glu His Glu Asn Lys Ser Cys Asn Thr 
1425 1430 1435 1440 

Val Trp Gly Leu Leu Arg Gly Ala Arg lie Gly Lys lie Cys Asn Arg 

1445 1450 1455 

Asp Ser Arg His Arg Lys lie Arg Ala Val lie His Trp Asn Tyr Glu 
1460 1465 1470 

lie Trp Cys Met Glu Asn Val Arg Lys Thr Leu Arg Gly Met Gly Lys 
1475 1480 1485 

Arg Gin Leu Leu Phe Ser Cys Thr Leu Ala Gly Lys Asn Ala Arg lie 
1490 1495 1500 

Thr Cys Lys Glu Pro lie Asn Gin Pro Cys Cys Ser lie Phe Ser Phe 
1505 1510 1515 1520 

Ala Trp Gin Val Arg Arg Trp lie His Thr Arg Ser Val lie Val Arg 

1525 1530 1535 

Tyr Pro Phe Cys Arg Leu Arg Tyr Ser Lys Leu Ser Asn Leu Tyr Gly 
1540 1545 1550 



Gin lie Val Asp lie His Arg Cys Glu Lys Cys Trp Asp Ser Tyr Ser 
1555 1560 1565 

Arg Leu Leu Gly Tyr Arg Ala Gly Gly Ser Tyr Val Tyr Leu Ser Cys 
1570 1575 1580 

Phe Cys Ala Gly Ala Phe Arg Leu lie Leu Arg Cys Glu Lys Ser Gin 
1585 1590 1595 1600 

Arg Gly Arg lie Gly Leu Arg Asn lie Gly Lys Thr lie Gin Gin Asn 

1605 1610 1615 

Leu Asn Ala Gly Cys Phe Gly Leu Gly Arg Leu Cys Gly lie Gly Lys 
1620 1625 1630 

Gin Cys Arg Val Ser Cys Trp Arg Gly Gly Pro Asn Gin Ala Ala Val 
1635 1640 1645 

Arg Asn Leu Ser Tyr Ser Ser Gly Ser Arg Ala Gly Lys Arg Leu Lys 
1650 1655 1660 

Arg Ser Tyr Asn Arg Ser Arg Arg Pro Phe Ser Arg Gly Ala Arg Thr 
1665 1670 1675 1680 

Asp Thr Gly Asn Gly Lys Lys Asn lie Ser Ala Arg- Leu Arg Ser Ser 

1685 1690 1695 

Pro Cys Gly Tyr Val Phe Thr Arg Arg Pro His Cys Thr Glu Arg Ser 
1700 1705 1710 

Gin Tyr Ser Ala Arg Phe His Val lie Gin Ser Leu Ser Pro Tyr Asp 
1715 1720 1725 

Gly Arg Cys Arg Tyr Cys Thr Ser Arg Thr Asp Pro Leu Asp Arg lie 
1730 1735 1740 

Ser Val Lys Gly Val lie Ser Met Glu lie Gly Phe Thr Phe Leu Asp 
1745 1750 1755 1760 

Glu lie Val His Gly Val Arg Trp Asp Ala Lys Tyr Ala Thr Trp Asp 

1765 1770 1775 

Asn Phe Thr Gly Lys Pro Val Asp Gly Tyr Glu Val Asn Arg lie Val 
1780 1785 1790 

Gly Thr Tyr Glu Leu Ala Glu Ser Leu Leu Lys Ala Lys Glu Leu Ala 
1795 1800 1805 

Ala Thr Gin Gly Tyr Gly Leu Leu Leu Trp Asp Gly Tyr Arg Pro Lys 
1810 1815 1820 

Arg Ala Val Asn Cys Phe Met Gin Trp Ala Ala Gin Pro Glu Asn Asn 
1825 1830 1835 1840 

Leu Thr Lys Glu Ser Tyr Tyr Pro Asn lie Asp Arg Thr Glu Met lie 

1845 1850 1855 



Ser Lys Gly Tyr Val Ala Ser Lys Ser Ser His Ser Arg Gly Ser Ala 
1860 1865 1870 

lie Asp Leu Thr Leu Tyr Arg Leu Asp Thr Gly Glu Leu Val Pro Met 
1875 1880 1885 

Gly Ser Arg Phe Asp Phe Met Asp Glu Arg Ser His His Ala Ala Asn 
1890 1895 1900 

Gly lie Ser Cys Asn Glu Ala Gin Asn Arg Arg Arg Leu Arg Ser lie 
1905 1910 1915 1920 

Met Glu Asn Ser Gly Phe Glu Ala Tyr Ser Leu Glu Trp Trp His Tyr 

1925 1930 1935 

Val Leu Arg Asp Glu Pro Tyr Pro Asn Ser Tyr Phe Asp Phe Pro Val 
1940 1945 1950 

Lys Thr Phe Asn Arg Cys Thr Asp Lys Leu Tyr Lys Leu Thr Leu Ser 
1955 1960 1965 

Ala Gly Asn Pro Thr Tyr Val Thr Gly Ser Gly lie Tyr lie lie Val 
1970 1975 1980 

Leu Lys Met Gly Arg Ala lie Leu Arg Ser Leu Ser Ala Cys Ala Ala 
1985 1990 1995 2000 

Ala Arg Pro Asp Asn Lys Thr Asp Arg lie Glu Gly Trp Tyr Phe Thr 

2005 2010 2015 

Pro Pro lie Val Asn Arg Gin Phe Ser Leu Val Lys Phe Ser Met Gly 
2020 2025 2030 

lie Thr Tyr Glu Asn Ser Ser Thr Leu Val lie lie Val Asn Pro Val 
2035 2040 2045 

Gly Arg Asn Asn Leu Phe Thr Gly Gin Asn Gly Thr lie Ser Asn Glu 
2050 2055 2060 

lie Val Pro Phe Lys Gly Lys lie Leu Glu lie Phe His Thr Ser Asn 
2065 2070 2075 2080 

Tyr lie Val Lys Glu Glu Thr Glu Asn Glu Glu Val Val Phe Phe lie 

2085 2090 2095 

Val lie Val lie Leu Asn lie Leu Arg Leu Leu Arg Ser Thr Val Phe 
2100 2105 2110 

Ser Gly Lys Ser Arg lie Ser Lys Leu Ser Lys Ser Gin Arg Thr Phe 
2115 2120 2125 

Arg Lys Trp Asp Phe Lys Tyr Pro Arg Glu Asn Asn Tyr Arg Arg Thr 
2130 2135 2140 

Gly Leu Ser Arg Lys Ser Ala lie Asn Gin lie Ser Cys Ser Pro Arg 
2145 2150 2155 2160 



Ser Val Lys Ser Asp lie Val Asn Leu Ser Lys His Asp Glu Leu lie 

2165 2170 2175 

Asn Gly Tyr Gly Leu Leu Asp Ser Asn lie Tyr Met Ser Lys Glu lie 
2180 2185 2190 

Ala Gin Lys Phe Ser Glu Met Val Asn Asp Ala Val Lys Gly Gly Val 
2195 2200 2205 

Ser His Phe lie lie Asn Ser Gly Tyr Arg Asp Phe Asp Glu Gin Ser 
2210 2215 2220 

Val Leu Tyr Gin Glu Met Gly Ala Glu Tyr Ala Leu Pro Ala Gly Tyr 
2225 2230 2235 2240 

Ser Glu His Asn Ser Gly Leu Ser Leu Asp Val Gly Ser Ser Leu Thr 

2245 2250 2255 

Lys Met Glu Arg Ala Pro Glu Gly Lys Trp lie Glu Glu Asn Ala Trp 
2260 2265 2270 

Lys Tyr Gly Phe lie Leu Arg Tyr Pro Glu Asp Lys Thr Glu Leu Thr 
2275 2280 2285 

Gly lie Gin 
2290 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3190 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CAAAATATCA CCTCATTTTT GAGACAAGTC TTATGAGACG CTCTTAACTA TGATTTTATC 60 

AGTCTACTAC ATTTGTATCA ATAGAGTACA CTCTATTGAT ATATAATTGA ACTAATAAAT 12 0 

TGAAAATACA GAAATGGAAT GATACTGAAA TGAAAATTGC GAGAGGTAGA GAATTGCTTA 18 0 

CACCGGAACA GAGACAGGCT TTTATGCAAA TCCCTGAAGA TGAATGGATA CTGGGGACCT 240 

ACTTCACTTT TTCCAAACGG GATTTAGAAA TAGTTAATAA GCGAAGGAGG GAAGAAAACC 3 00 

GTTTAGGATT TGCTGTTCAA TTAGCTGTTC TTCGGTATCC CGGTTGGCCA TACACTCATA 3 60 

TCAAAAGCAT CCCAGATTCG GTCATACAAT AT AT AT CG AA ACAGATTGGT GTTAGTCCAT 42 0 

CCTCGCTTGA TCATTATCCT CAAAGGGAAA ATACACTTTG GGATCATTTG AAAGAAATTC 480 

GAAGTGAATA CGACTTTGTA ACTTTTACCC TGAGTGAATA TCGAATGACA TTTAAGTACC 54 0 



TTCATCAATT AGCTTTGGAA AATGGTGATG CCATTCATCT ACTGCATGAA TG CAT AG ATT 600 

TTCTAAGAAA AAACAAAATT ATACTGCCTG CTATCACTAC ACTTGAAAGA ATGGTGTGGG 660 

AAGCAAGGGC AATGGCTGAA AAGAAGCTAT TTAATACGGT TAGTAAATCT CTAACAAATG 720 

AGCAAAAAGA AAAGCTTGAA GGGATTATTA CCTCGCAGCA TCCATCCGAA TCCAATAAAA 780 

CGATATTGGG TTGGTTAAAA GAGCCACCGG GTCATCCTTC ACCCGAAACT TTTCTAAAAA 8 40 

TAATAGAACG ACTCGAATAC ATACGAGGAA TGGATTTAGA AACAGTGCAA ATTAGTCATT 9 00 

TGCACCGTAA CCGCCTGTTG CAGCTGTCTC GCTTAGGCTC AAGATACGAG CCGTATGCAT 9 60 

TCCGTGACTT TCAAGAAAAT AAACGTTATT CGATATTAAC CATCTATTTA TTACAACTTA 102 0 

CTCAGGAGCT AACGGATAAA GCGTTTGAAA TTCATGATAG GCAAATACTT AGTTTGTTAT 108 0 

CAAAAGGTCG TAAGGCTCAA GAGGAAATCC AGAAACAAAA CGGTAAAAAG CTAAATGAGA 114 0 

AAGTTATACA CTTTACGAAC ATCGGACAAG CATTAATTAA AG C AAG AG AG GAAAAATTAG 12 00 

ACGTTTTTAA GGTTTTAGAA TCGGTTATTG AATGGAATAC CTTTGTCTCT TCAGTAGAAG 12 60 

AGGCTCAGGA ACTTGCACGT CCTGCCGACT ATGATTATTT AGACTTACTG CAAAAACGGT 13 2 0 

TTTATTCACT AAGAAAATAT ACGCCAACGC TATTAAGAGT ATTGGAATTT CATTCTACAA 13 80 

AGGCAAATGA GCCACTTTTA CAAGCTGTTG AG ATT AT C CG AGGAATGAAC GAATCTGGAA 14 4 0 

AGCGAAAAGT GCCTGATGAC TCACCTGTGG ATTTTATTTC AAAACGATGG AAAAGACATT 15 0 0 

TATA C G AGG A TGATGGTACA ACAATTAATC GT C ATT ACT A TGAAATGGCT GTTTTAACAG 1560 

AACTTCGGGA GCATGTTCGG GCAGGAGATG TTTCCATTGT TGGCAGCAGA CAATATAGGG 162 0 

ATTTTGAGGA ATATTTGTTT TCGGAAGATA CATGGAATCA ATCGAAGGGG AATACGAGAT 168 0 

TATCAGTTAG TTTATCATTC GAAGATTATA TAACGGAGAG AACCAGCAGC TTTAATGAAA 174 0 

GGTTAAAGTG GTTAGCTGCC AATTCCAATA AGTTAGATGG GGTTTCTCTT GAAAAAGGAA 18 0 0 

AGCTATCACT TGCACGCTTA GAAAAAGATG TTCCAGAAGA AGCAAAAAAA TTTAGTGCAA 18 6 0 

GCCTTTATCA GATGCTACCA AGAATAAAAT TAACTGATTT ACTCATGGAT GTGGCCCATA 192 0 

TAACAGGATT TCATGAGCAA TTCACTCATG CTTCCAATAA TCGAAAACCA GATAAGGAAG 198 0 

AAACAATCAT TATCATGGCT GCCCTTTTAG GAATGGGAAT GAATATTGGC TTG AG C AAG A 2 04 0 

TGGCCGAAGC CACACCCGGA CTTACATATA AGCAACTAGC CAATGTATCT CAATGGCGCA 2100 

TGTATGAAGA TGCCATGAAT AAAGCCCAAG CCATATTAGT AAACTTTCAT CATAAATTAC 2160 

AATTGCCTTT CTATTGGGGC GACGGTACAA CATCTTCGTC AGATGGTATG AGAATGCAGC 2 220 

TAGGTGTTTC ATCACTACAT GCAGATGCAA ATCCACATTA TGGAACTGGA AAAGGAGCCA 22 80 



CCATCTACCG 


ATTTACAAGT 


GATCAATTCT 


CTTCTTACTA 


CACAAAGATT 


ATTCATACTA 


2340 


ATTCAAGAGA 


TGCGATTCAT 


GTTTTGGATG 


GTTTGTTACA 


TCATGAGACG 


GATCTAAACA 


2400 


TAGAGGAACA 


TTATACAGAC 


ACTGCCGGTT 


ACACTGACCA 


AATATTCGGA 


CTGACTCATT 


2460 


TATTAGGATT 


TAAATTTGCC 


CCAAGAATAA 


GGGATTTATC 


GGACTCAAAA 


TT ATT T AC G A 


2520 


TAGATAAAGC 


AAGTGAGTAT 


CCAAAACTAG 


AAGCCATTTT 


ACGTGGACAA 


ATAAATACAA 


2580 


AGGTCATTAA 


AGAAAATTAT 


GAGGATGTTT 


TGCGATTAGC 


TCATTCTATA 


AGGGAGGGAA 


2640 


CAGTTTCAGC 


ATCCCTTATT 


ATGGGGAAGC 


TAGGTTCCTA 


TTCAAGACAA 


AACAGCTTAG 


2700 


CTACAGCCTT 


ACGTGAGATG 


GGCCGAATAG 


AAAAAACGAT 


CTTTATTTTG 


AATTATATAT 


2760 


CGGATGAATC 


ATTAAGAAGA 


AAAATACAAA 


GAGGATTGAA 


TAAAGGAGAA 


GCCATGAATG 


2820 


GATTGGCAAG 


AGCTATTTTC 


TTCGGAAAAC 


AAGGTGAGCT 


TAGAGAACGC 


ACCATACAGC 


2880 


ATCAATTGCA 


AAGAGCCAGT 


GCTTTAAACA 


TAATTATCAA 


TGCTATAAGT 


ATTTGGAATA 


2940 


CTCTCCACCT 


AACAACAGCA 


GTTGAATATA 


AAAAACGGAC 


AGGTAGCTTT 


AATGAAGATT 


3000 


TGTTACACCA 


TATGTCGCCC 


TTAGGTTGGG 


AACATATTAA 


TTTACTAGGA 


GAATACCATT 


3060 


TTAACTCAGA 


GAAAGTAGTC 


TCATTAAATT 


CTTTAAGACC 


ACTAAAACTT 


TCTTAACGTT 


3120 


GTTAAAAACG 


AGGGATTCGT 


CAGGAAAATA 


GGCTTAGCGT 


TGTAAATCCG 


CATTTTCCTG 


3180 



ACGCTACCCC 319 0 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1347 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GAGCTCTTCC TTCAACGCAC TTCTGTACCA AGAGTTGTTG TCCATTTGAT CACTAACAAT 60 

AGCTTCCCCT GCTTTCTTCA AGCCCTTTGT CATAAAATCG TTAGATTTTC AT C AT AAAAA 12 0 

TACGAGAAAG ACAACAGGAA GACCGCAAAT TTTCTTTTCT TTTCCTAGGT ACACTGAATG 18 0 

TAACCTTAAA AGAAAAAAGG AAAGGAAGAA AATGATGAAA AAAATTGCCG TTTTATTTGG 240 

AGGGAATTCT CCAGAATACT CAGTGTCACT AACCTCAGCA GCAAGTGTGA TCCAAGCTAT 3 00 

TGACCCGCTG AAATATGAAG TAATGACCAT TGGCATCGCA CCAACAATGG ATTGGTATTG 3 60 



GTATCAAGGA 


AACCTCGCGA 


ATGTTCGCAA 


TGATACTTGG 


CTAGAAGATC 


ACAAAAACTG 


420 


TCACCAGCTG 


ACTTTTTCTA 


GCCAAGGATT 


TATATTAGGA 


GAAAAACGAA 


TCGTCCCTGA 


480 


TGTCCTCTTT 


CCAGTCTTGC 


ATGGGAAGTA 


TGGCGAGGAT 


GGCTGTATCC 


AAGGACTGCT 


540 


TGAACTAATG 


AACCTGCCTT 


ATGTTGGTTG 


CCATGTCGCT 


GCCTCCGCAT 


TATGTATGAA 


600 


CAAATGGCTC 


TTGCATCAAC 


TTGCTGATAC 


CATGGGAATC 


GCTAGTGCTC 


CCACTTTGCT 


660 


TTTATCCCGC 


TATGAAAACG 


ATCCTGCCAC 


AATCGATCGT 


TTTATTCAAG 


AC C ATGG ATT 


720 


CCCGATCTTT 


ATCAAGCCGA 


ATGAAGCCGG 


TTCTTCAAAA 


GGGATCACAA 


AAGTAACTGA 


780 


CAAAACAGCG 


CTCCAATCTG 


CATTAACGAC 


TGCTTTTGCT 


TACGGTTCTA 


CTGTGTTGAT 


840 


CCAAAAGGCG 


ATAGCGGGTA 


TTGAAATTGG 


CTGCGGCATC 


TTAGGAAATG 


AGCAATTGAC 


900 


GATTGGTGCT 


TGTGATGCGA 


TTTCTCTTGT 


CGACGGTTTT 


TTTGATTTTG 


AAGAGAAATA 


960 


CCAATTAATC 


AGCGCCACGA 


TCACTGTCCC 


AGCACCATTG 


CCTCTCGCGC 


TTGAATCACA 


1020 


GATCAAGGAG 


CAGGCACAGC 


TGCTTTATCG 


AAACTTGGGA 


TTGACGGGTC 


TGGCTCGAAT 


1080 


CGATTTTTTC 


GTCACCAATC 


AAGGAGCGAT 


TTATTTAAAC 


GAAATCAACA 


CCATGCCGGG 


1140 


ATTTACTGGG 


CACTCCCGCT 


ACCCAGCTAT 


GATGGCGGAA 


GTCGGGTTAT 


CCTACGAAAT 


1200 


ATTAGTAGAG 


CAATTGATTG 


CACTGGCAGA 


GGAGGACAAA 


CGATGAACAC 


ATTACAATTG 


1260 


ATCAATAAAA 


ACCATCCATT 


GAAAAAAAAT 


CAAGAGCCCC 


CGCACTTAGT 


GCTAGCTCCT 


1320 


TTTAGCGATC 


ACGATGTTTA 


CCTGCAG 








1347 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Glu Lys Leu Arg Val Gly lie Val Gly Gly Gly Lys Ser Ala Glu 
15 10 15 

His Glu Val Ser Leu Gin Ser Ala Lys Asn lie Val Asp Ala lie Asp 
20 25 30 

Lys Ser Arg Phe Asp Val Val Leu Leu Gly lie Asp Lys Gin Gly Gin 
35 40 45 

Trp His Val Ser Asp Ala Ser Asn Tyr Leu Leu Asn Ala Asp Asp Pro 
50 55 60 



Ala His lie Ala Leu Arg Pro Ser Ala Thr Ser Leu Ala Gin Val Pro 
65 70 75 80 

Gly Lys His Glu His Gin Leu lie Asp Ala Gin Asn Gly Gin Pro Leu 

85 90 95 

Pro Thr Val Asp Val lie Phe Pro lie Val His Gly Thr Leu Gly Glu 
100 105 110 

Asp Gly Ser Leu Gin Gly Met Leu Arg Val Ala Asn Leu Pro Phe Val 
115 120 125 

Gly Ser Asp Val Leu Ala Ser Ala Ala Cys Met Asp Lys Asp Val Thr 
130 135 140 

Lys Arg Leu Leu Arg Asp Ala Gly Leu Asn lie Ala Pro Phe lie Thr 
145 150 155 160 

Leu Thr Arg Ala Asn Arg His Asn lie Ser Phe Ala Glu Val Glu Ser 

165 170 175 

Lys Leu Gly Leu Pro Leu Phe Val Lys Pro Ala Asn Gin Gly Ser Ser 
180 185 190 

Val Gly Val Ser Lys Val Thr Ser Glu Glu Gin Tyr Ala Thr Ala Val 
195 200 205 

Ala Leu Ala Phe Glu Phe Asp His Lys Val lie Val Glu Gin Gly lie 
210 215 220 

Lys Gly Arg Glu lie Glu Cys Ala Val Leu Gly Asn Asp Asn Pro Gin 
225 230 235 240 

Ala Ser Thr Cys Gly Glu lie Val Leu Thr Ser Asp Phe Tyr Ala Tyr 

245 250 255 

Asp Thr Lys Tyr lie Asp Glu Asp Gly Ala Lys Val Val Val Pro Ala 
260 265 270 

Ala lie Ala Pro Glu lie Asn Asp Lys lie Arg Ala lie Ala Val Gin 
275 280 285 

Ala Tyr Gin Thr Leu Gly Cys Ala Gly Met Ala Arg Val Asp Val Phe 
290 295 300 

Leu Thr Pro Glu Asn Glu Val Val lie Asn Glu lie Asn Thr Leu Pro 
305 310 315 320 

Gly Phe Thr Asn lie Ser Met Tyr Pro Lys Leu Trp Gin Ala Ser Gly 

325 330 335 

Leu Gly Tyr Thr Asp Leu lie Thr Arg Leu lie Glu Leu Ala Leu Glu 
340 345 350 

Arg His Ala Ala Asn Asn Ala Leu Lys Thr Thr Met 
355 360 




*(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 06 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Met Thr Asp Lys lie Ala Val Leu Leu Gly Gly Thr Ser Ala Glu Arg 
15 10 15 

Glu Val Ser Leu Asn Ser Gly Ala Ala Val Leu Ala Gly Leu Arg Glu 
20 25 30 

Gly Gly lie Asp Ala Tyr Pro Val Asp Pro Lys Glu Val Asp Val Thr 
35 40 45 

Gin Leu Lys Ser Met Gly Phe Gin Lys Val Phe lie Ala Leu His Gly 
50 55 60 

Arg Gly Gly Glu Asp Gly Thr Leu Gin Gly Met Leu Glu Leu Met Gly 
65 70 75 80 

Leu Pro Tyr Thr Gly Ser Gly Val Met Ala Ser Ala Leu Ser Met Asp 

85 90 95 

Lys Leu Arg Ser Lys Leu Leu Trp Gin Gly Ala Gly Leu Pro Val Ala 
100 105 110 

Pro Trp Val Ala Leu Thr Arg Ala Glu Phe Glu Lys Gly Leu Ser Asp 
115 120 125 

Lys Gin Leu Ala Glu lie Ser Ala Leu Gly Leu Pro Val lie Val Lys 
130 135 140 

Pro Ser Arg Glu Gly Ser Ser Val Gly Met Ser Lys Val Val Ala Glu 
145 150 155 160 

Asn Ala Leu Gin Asp^Ala Leu Arg Leu Ala Phe Gin His Asp Glu Glu 

165 170 175 

Val Leu lie Glu Lys Trp Leu Ser Gly Pro Glu Phe Thr Val Ala lie 
180 185 190 

Leu Gly Glu Glu lie Leu Pro Ser lie Arg lie Gin Pro Ser Gly Thr 
195 200 205 

Phe Tyr Asp Tyr Glu Ala Lys Tyr Leu Ser Asp Glu Thr Gin Tyr Phe 
210 215 220 

Cys Pro Ala Gly Leu Glu Ala Ser Gin Glu Ala Asn Leu Gin Ala Leu 
225 230 235 240 



Val Leu Lys Ala Trp Thr Thr Leu 

245 

Asp Val Met Leu Asp Ser Asp Gly 
260 

Thr Ser Pro Gly Met Thr Ser His 
275 280 

Gin Ala Gly Met Ser Phe Ser Gin 
290 295 

Ala Asp 
305 

INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



Gly Cys Lys Gly Trp Gly Arg lie 
250 255 

Gin Phe Tyr Leu Leu Glu Ala Asn 
265 270 

Ser Leu Val Pro Met Ala Ala Arg 

285 

Leu Val Val Arg lie Leu Glu Leu 
300 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Leu Phe Pro Met Val lie 
1 5 

INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Ser Thr Gin Asn Cys 
1 5 

INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

Gly Glu Asp Gly Ser lie Gin Gly 
1 5 

INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

Asn Thr Leu Pro Gly Phe Thr 
1 5 

INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

Gly Glu Asp Gly Thr Leu Gin Gly 
1 5 

INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

Asn Thr Ser Pro Gly Met Thr 
1 5 



INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 59 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: E. faecium 

(B) STRAIN: BM414 7 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Glu Met Asp Val Met Glu Gin Lys Leu Asn Thr Leu Lys Arg Thr Leu 
15 10 15 

Glu Lys Arg Glu Gin Asp Ala Lys Leu Ala Glu Gin Arg Lys Asn Asp 
20 25 30 

Val Val Met Tyr Leu Ala His Asp lie Lys Thr Pro Leu Thr Ser lie 
35 40 45 

lie Gly Tyr Leu Ser Leu Leu Asp Glu Ala Pro Asp Met Pro Val Asp 
50 55 60 

Gin Lys Ala Lys Tyr Val His lie Thr Leu Asp Lys Ala Tyr Arg Leu 
65 70 75 80 

Glu Gin Leu lie Asp Glu Phe Phe Glu lie Thr Arg Tyr Asn Leu Gin 

85 90 95 

Thr lie Thr Leu Thr Lys Thr His lie Asp Leu Tyr Tyr Met Leu Val 
100 105 110 

Gin Met Thr Asp Glu Phe Tyr Pro Gin Leu Ser Ala His Gly Lys Gin 
115 120 125 

Ala Val lie His Ala Pro Glu Asp Leu Thr Val Ser Gly Asp Pro Asp 
130 135 140 

Lys Leu Ala Arg Val Phe Asn Asn lie Leu Lys Asn Ala Ala Ala Tyr 
145 150 155 160 

Ser Glu Asp Asn Ser lie lie Asp lie Thr Ala Gly Leu Ser Gly Asp 

165 170 175 

Val Val Ser lie Glu Phe Lys Asn Thr Gly Ser lie Pro Lys Asp Lys 
180 185 190 

Leu Ala Ala lie Phe Glu Lys Phe Tyr Arg Leu Asp Asn Ala Arg Ser 
195 200 205 



Ser Asp Thr Gly Gly Ala Gly Leu Gly Leu Ala lie Ala Lys Glu lie 
210 215 220 

lie Val Gin His Gly Gly Gin lie Tyr Ala Glu Ser Asn Asp Asn Tyr 
225 230 235 240 

Thr Thr Phe Arg Val Glu Leu Pro Ala Met Pro Asp Leu Val Asp Lys 

245 250 255 

Arg Arg Ser 



INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 



(xi) SEQUENCE DESCRII 

Glu lie Arg Val Met 
1 5 

Arg Asp Val Thr Gin 
20 

Phe Ala Asn Val Ser 
35 

Gly Tyr Leu Glu Met 
50 

Glu Lys Ala Leu His 

65 

Leu Val Lys Gin Leu 

85 

His Leu Leu Asn Glu 
100 

Glu Arg Glu Ala Gin 
115 

Glu lie Asp Asn Gly 
130 

Ser Ala lie Ser Asn 
145 



TION: SEQ ID NO: 41: 

Pro Tyr Thr His Lys Gin 

10 

Met His Gin Leu Glu Gly 
25 

His Glu Leu Arg Thr Pro 
40 

Met Asn Glu Gin Pro Leu 
55 

Thr Met Arg Glu Gin Thr 
70 75 

Leu Thr Leu Ser Lys lie 

90 

Lys Val Asp Val Pro Met 
105 

Thr Leu Ser Gin Lys Lys 
120 

Leu Lys Val Ser Gly Asn 
135 

Leu Val Tyr Asn Ala Val 
150 155 



Leu Leu Met Val Ala 
15 

Ala Arg Arg Asn Phe 
30 

Leu Thr Val Leu Gin 
45 

Glu Gly Ala Val Arg 
60 

Gin Arg Met Glu Gly 

80 

Glu Ala Ala Pro Thr 

95 

Met Leu Arg Val Val 
110 

Gin Thr Phe Thr Phe 
125 

Glu Asp Gin Leu Arg 
140 

Asn His Thr Pro Glu 

160 



Gly Thr His lie Thr Val Arg Trp Gin Arg Val Pro His Gly Ala Glu 

165 170 175 

Phe Ser Val Glu Asp Asn Gly Pro Gly lie Ala Pro Glu His lie Pro 
180 185 190 

Arg Leu Thr Glu Arg Phe Tyr Arg Val Asp Lys Ala Arg Ser Arg Gin 
195 200 205 

Thr Gly Gly Ser Gly Leu Gly Leu Ala lie Val Lys His Ala Val Asn 
210 215 220 

His His Glu Ser Arg Leu Asn lie Glu Ser Thr Val Gly Lys Gly Thr 
225 230 235 240 

Arg Phe Ser Phe Val lie Pro Glu Arg Leu lie Ala Lys Asn Ser Asp 

245 250 255 



INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Ala Ser Glu Val Arg Ser Val Thr Arg Ala Phe Asn His Met Ala Ala 
15 10 15 

Gly Val Lys Gin Leu Ala Asp Asp Arg Thr Leu Leu Met Ala Gly Val 
20 25 30 

Ser His Asp Leu Arg Thr Pro Leu Thr Arg lie Arg Leu Ala Thr Glu 
35 40 45 

Met Met Ser Glu Gin Asp Gly Tyr Leu Ala Glu Ser lie Asn Lys Asp 
50 55 60 

lie Glu Glu Cys Asn Ala lie lie Glu Gin Phe lie Asp Tyr Leu Arg 
65 70 75 80 

Thr Gly Gin Glu Met Pro Met Glu Met Ala Asp Leu Asn Ala Val Leu 

85 90 95 

Gly Glu Val lie Ala Ala Glu Ser Gly Tyr Glu Arg Glu lie Glu Thr 
100 105 110 



Ala Leu Tyr Pro Gly Ser lie Glu Val Lys Met His Pro Leu Ser lie 
115 120 125 

Lys Arg Ala Val Ala Asn Met Val Val Asn Ala Ala Arg Tyr Gly Asn 
130 135 140 

Gly Trp lie Lys Val Ser Ser Gly Thr Glu Pro Asn Arg Ala Trp Phe 
145 150 155 160 

Gin Val Glu Asp Asp Gly Pro Gly lie Ala Pro Glu Gin Arg Lys His 

165 170 175 

Leu Phe Gin Pro Phe Val Arg Gly Asp Ser Ala Arg Thr lie Ser Gly 
180 185 190 

Thr Gly Leu Gly Leu Ala lie Val Gin Arg lie Val Asp Asn His Asn 
195 200 205 

Gly Met Leu Glu Leu Gly Thr Ser Glu Arg Gly Gly Leu Ser lie Arg 
210 215 220 

Ala Trp Leu Pro Val Pro Val Thr Arg Ala Gin Gly Thr Thr Lys Glu 
225 230 235 240 

Gly 

INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: E. faecium 

(B) STRAIN: BM4147 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Met Ser Asp Lys lie Leu lie Val Asp Asp Glu His Glu lie Ala Asp 
1 5 10 15 

Leu Val Glu Leu Tyr Leu Lys Asn Glu Asn Tyr Thr Val Phe Lys Tyr 
20 25 30 

Tyr Thr Ala Lys Glu Ala Leu Glu Cys lie Asp Lys Ser Glu lie Asp 
35 40 45 

Leu Ala lie Leu Asp lie Met Leu Pro Gly Thr Ser Gly Leu Thr lie 
50 55 60 

Cys Gin Lys lie Arg Asp Lys His Thr Tyr Pro lie lie Met Leu Thr 
65 70 75 80 



Gly Lys Asp Thr Glu Val Asp Lys lie Thr Gly Leu Thr lie Gly Ala 

85 90 95 

Asp Asp Tyr lie Thr Lys Pro Phe Arg Pro Leu Glu Leu lie Ala Arg 
100 105 110 

Val Lys Ala Gin Leu Arg Arg Tyr Lys Lys Phe Ser Gly Val Lys Glu 
115 120 125 

Gin Asn Glu Asn Val lie Val His Ser Gly Leu Val lie Asn Val Asn 
130 135 140 

Thr His Glu Cys Tyr Leu Asn Glu Lys Gin Leu Ser Leu Thr Pro Thr 
145 150 155 160 

Glu Phe Ser lie Leu Arg lie Leu Cys Glu Asn Lys Gly Asn Val Val 

165 170 175 

Ser Ser Glu Leu Leu Phe His Glu lie Trp Gly Asp Glu Tyr Phe Ser 
180 185 190 

Lys Ser Asn Asn Thr lie Thr Val His lie Arg His Leu Arg Glu Lys 
195 200 205 

Met Asn Asp Thr lie Asp Asn Pro Lys Tyr lie Lys Thr Val Trp Gly 
210 215 220 

Val Gly Tyr Lys lie Glu Lys 
225 230 

INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Met Gin Glu Asn Tyr Lys lie Leu Val Val Asp Asp Asp Met Arg Leu 
15 10 15 

Arg Ala Leu Leu Glu Arg Tyr Leu Thr Glu Gin Gly Phe Gin Val Arg 
20 25 30 

Ser Val Ala Asn Ala Glu Gin Met Asp Arg Leu Leu Thr Arg Glu Ser 
35 40 45 

Phe His Leu Met Val Leu Asp Leu Met Leu Pro Gly Glu Asp Gly Leu 
50 55 60 



Ser lie Cys Arg Arg Leu Arg Ser Gin Ser Asn Pro Met Pro lie lie 
65 70 75 80 

Met Val Thr Ala Lys Gly Glu Glu Val Asp Arg lie Val Gly Leu Glu 

85 90 95 

lie Gly Ala Asp Asp Tyr lie Pro Lys Pro Phe Asn Pro Arg Glu Leu 
100 105 110 

Leu Ala Arg lie Arg Ala Val Leu Arg Arg Gin Ala Asn Glu Leu Pro 
115 120 125 

Gly Ala Pro Ser Gin Glu Glu Ala Val lie Ala Phe Gly Lys Phe Lys 
130 135 140 

Leu Asn Leu Gly Thr Arg Glu Met Phe Arg Glu Asp Glu Pro Met Pro 
145 150 155 160 

Leu Thr Ser Gly Glu Phe Ala Val Leu Lys Ala Leu Val Ser His Pro 

165 170 175 

Arg Glu Pro lie Ser Arg Asp Lys Leu Met Asn Leu Ala Arg Gly Arg 
180 185 190 

Glu Tyr Ser Ala Met Glu Arg Ser lie Asp Val Gin lie Ser Arg Leu 
195 200 205 

Arg Arg Met Val Glu Glu Asp Pro Ala His Pro Arg Tyr lie Gin Thr 
210 215 220 

Val Trp Gly Leu Gly Tyr Val Phe Val Pro Asp Gly Ser Lys Ala 
225 230 235 

INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Ala Arg Arg lie Leu Val Val Glu Asp Glu Ala Pro lie Arg Glu 
15 10 15 

Met Val Cys Phe Val Leu Glu Gin Asn Gly Phe Gin Pro Val Glu Ala 
20 25 30 

Glu Asp Tyr Asp Ser Ala Val Asn Gin Leu Asn Glu Pro Trp Pro Asp 
35 40 45 



Leu lie Leu Leu Asp Trp Met Leu Pro Gly Gly Ser Gly lie Gin Phe 
50 55 60 

lie Lys His Leu Lys Arg Glu Ser Met Thr Arg Asp lie Pro Val Val 
65 70 75 80 

Met Leu Thr Ala Arg Gly Glu Glu Glu Asp Arg Val Arg Gly Leu Glu 

85 90 95 

Thr Gly Ala Asp Asp Tyr lie Thr Lys Pro Phe Ser Pro Lys Glu Leu 
100 105 110 

Val Ala Arg lie Lys Ala Val Met Arg Arg lie Ser Pro Met Ala Val 
115 120 125 

Glu Glu Val lie Glu Met Gin Gly Leu Ser Leu Asp Pro Thr Ser His 
130 135 -140 

Arg Val Met Ala Gly Glu Glu Pro Leu Glu Met Gly Pro Thr Glu Phe 
145 150 155 160 

Lys Leu Leu His Phe Phe Met Thr His Pro Glu Arg Val Tyr Ser Arg 

165 170 175 

Glu Gin Leu Leu Asn His Val Trp Gly Thr Asn Val Tyr Val Glu Asp 
180 185 190 

Arg Thr Val Asp Val His lie Arg Arg Leu Arg Lys Ala Leu Glu Pro 
195 200 205 

Gly Gly His Asp Arg Met Val Gin Thr Val Arg Gly Thr Gly Tyr Arg 
210 215 220 

Phe Ser Thr Arg Phe 
225 

INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Ala Asp Lys Glu Leu Lys Phe Leu Val Val Asp Asp Phe Ser Thr 
1 5 10 15 

Met Arg Arg lie Val Arg Asn Leu Leu Lys Glu Leu Cys Phe Asn Asn 
20 25 30 



Val Glu Glu Ala Glu Asp Gly Val Asp Ala Leu Asn Lys Leu Gin Ala 
35 40 45 

Gly Gly Phe Gly Phe lie lie Ser Asp Trp Asn Met Pro Asn Met Asp 
50 55 60 

Gly Leu Glu Leu Leu Lys Thr lie Arg Ala Asp Ser Ala Met Ser Ala 
65 70 75 80 

Leu Pro Val Leu Met Val Thr Ala Glu Ala Lys Lys Glu Asn lie lie 

85 90 95 

Ala Ala Ala Gin Ala Gly Ala Ser Gly Tyr Val Val Lys Pro Phe Thr 
100 105 110 

Ala Ala Thr Leu Glu Glu Lys Leu Asn Lys lie Phe Glu Lys Leu Gly 
115 120 125 

Met 

INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: E. faecium 

(B) STRAIN: BM414 7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Met Asn Arg lie Lys Val Ala lie Leu 
1 5 

INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: RBS 

(B) LOCATION: 1..10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
TGAAAGGAGA 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
UCUUUCCUCC 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GCTGCAGATA AAAATTTAGG AGG 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
CGCATGCTAT TATAAAAGCC AGTC 



+ 



10 



10 
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y)) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
TGAAAGGGTG 10 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: rRNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: B. subtilis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GGGGGGTTGG NNNNNNNNTT G 21 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: rRNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: B. subtilis 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
AGAACGAAAA NNNNNNATG 
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